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igure 1A : Genera! description of the polyphage principle 
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Figure 1B: General description of the polyphage principle (cont.) 
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Figure 2A 
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1 AACGCTACTA CCATTAGTAG AATTGATGCC ACCTTTTCAG CTCGCGCCCC 
TTGCGATGAT GGTAATCATC TTAACTACGG TGGAAAAGTC GAGCGCGGGG 

51 AAATGAAAAT ATAGCTAAAC AGGTTATTGA CCATTTGCGA AATGTATCTA 
TTTACTTTTA TATCGATTTG TCCAATAACT GGTAAACGCT TTACATAGAT 

101 ATGGTCAAAC TAAATCTACT CGTTCGCAGA ATTGGGAATC AACTGTTACA 
TACCAGTTTG ATTTAGATGA GCAAGCGTCT TAACCCTTAG TTGACAATGT 

151 TGGAATGAAA CTTCCAGACA CCGTACTTTA GTTG CAT ATT TAAAACATGT 
ACCTTACTTT GAAGGTCTGT GGCATGAAAT CAACGTATAA ATTTTGTACA 

2 01 TGAACTACAG CAC CAGATTC AGCAATTAAG CTCTAAGCCA TCCGCAAAAA 
ACTTGATGTC GTGGTCTAAG TCGTTAATTC GAGATTCGGT AGGCGTTTTT 

2 51 TGACCTCTTA TCAAAAGGAG CAATTAAAGG TACTGTCTAA TCCTGACCTG 

ACTGGAGAAT AGTTTTCCTC GTTAATTTCC ATGACAGATT AGGACTGGAC 

3 01 TTGGAATTTG CTTCCGGTCT GGTTCGCTTT GAGGCTCGAA TTGAAACGCG 

AACCTTAAAC GAAGGCCAGA CCAAGCGAAA CTCCGAGCTT AACTTTGCGC 

3 51 ATATTTGAAG TCTTTCGGGC TTCCTCTTAA TCTTTTTGAT GCAATTCGCT 

TATAAACTTC AGAAAGCCCG AAGGAGAATT AGAAAAACTA CGTTAAGCGA 

4 01 TTGCTTCTGA CTATAATAGA CAGGGTAAAG ACCTGATTTT TGATTTATGG 

AACGAAGACT GATATTATCT GTCCCATTTC TGGACTAAAA ACTAAATACC 

4 51 TCATTCTCGT TTTCTGAACT GTTTAAAGCA TTTGAGGGGG ATTCAATGAA 
AGTAAGAGCA AAAGACTTGA CAAATTTCGT AAACTCCCCC TAAGTTACTT 

501 TATTTATGAC GATTCCGCAG TATTGGACGC TATCCAGTCT AAACATTTTA 
ATAAATACTG CTAAGGCGTC ATAACCTGCG ATAGGTCAGA TTTGTAAAAT 

551 CAATTACCCC CTCTGGCAAA ACTTCCTTTG CAAAAGCCTC TCGCTATTTT 
GTTAATGGGG GAGACCGTTT TGAAGGAAAC GTTTTCGGAG AGCGATAAAA 

601 GGTTTCTATC GTCGTCTGGT TAATGAGGGT TATGATAGTG TTGCTCTTAC 
CCAAAGATAG CAGCAGACCA ATTACTCCCA ATACTATCAC AACGAGAATG 

651 CATGCCTCGT AATTCCTTTT GGCGTTATGT ATCTGCATTA GTTGAGTGTG 
GTACGGAGCA TTAAGGAAAA CCGCAATACA TAGACGTAAT CAACTCACAC 

701 GTATTCCTAA ATCTCAATTG ATGAATCTTT CCACCTGTAA TAATGTTGTT 
CATAAGGATT TAGAGTTAAC TACTTAGAAA GGTGGACATT ATTACAACAA 

751 CCGTTAGTTC GTTTTATTAA CGTAGATTTT TCCTCCCAAC GTCCTGACTG 
GGCAATCAAG CAAAATAATT GCATCTAAAA AGGAGGGTTG CAGGACTGAC 

801 GTATAATGAG CCAGTTCTTA AAATCGCATA AGGTAATTCA AAATGATTAA 
CATATTACTC GGTCAAGAAT TTTAGCGTAT TCCATTAAGT TTTACTAATT 
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Figure 2C 5/39 



851 AGTTGAAATT AAACCGTCTC AAGCGCAATT TACTACCCGT TCTGGTGTTT 
TCAACTTTAA TTTGG CAG AG TTCGCGTTAA ATGATGGGCA AGACCACAAA 



901 CTCGTCAGGG CAAGCCTTAT TCACTGAATG AGCAGCTTTG TTACGTTGAT 
GAGCAGTCCC GTTCGGAATA AGTGACTTAC TCGTCGAAAC AATGCAACTA 

951 TTGGGTAATG AATATCCGGT GCTTGTCAAG ATTACTCTCG ACGAAGGTCA 
AACCCATTAC TTATAGGCCA CGAACAGTTC TAATGAGAGC TGCTTCCAGT 

1001 GCCAGCGTAT GCGCCTGGTC TGTACACCGT GCATCTGTCC TCGTTCAAAG 
CGGTCGCATA CGCGGACCAG ACATGTGGCA CGTAGACAGG AGCAAGTTTC 

1051 TTGGTCAGTT CGGTTCTCTT ATGATTGACC GTCTGCGCCT CGTTCCGGCT 
AACCAGTCAA GCCAAGAGAA TACTAACTGG CAGACGCGGA GCAAGGCCGA 

1101 AAGTAACATG GAGCAGGTCG CGGATTTCGA CACAATTTAT CAGGCGATGA 
TTCATTGTAC CTCGTCCAGC GCCTAAAGCT GTGTTAAATA GTCCGCTACT 

1151 TACAAATCTC CGTTGTACTT TGTTTCGCGC TTGGTATAAT CGCTGGGGGT 
ATGTTTAGAG GCAACATGAA ACAAAGCGCG AACCATATTA GCGACCCCCA 

12 01 CAAAGATGAG TGTTTTAGTG TATTCTTTCG CCTCTTTCGT TTTAGGTTGG 
GTTTCTACTC ACAAAATCAC ATAAGAAAGC GGAGAAAGCA AAATCCAACC 

12 51 TGCCTTCGTA GTGGCATTAC GTATTTTACC CGTTTAATGG AAACTTCCTC 

ACGGAAGCAT CACCGTAATG CATAAAATGG GCAAATTACC TTTGAAGGAG 

13 01 ATGCGTAAGT CTTTAGTCCT CAAAGCCTCC GTAGCCGTTG CTACCCTCGT 

TACGCATTCA GAAATCAGGA GTTTCGGAGG CATCGGCAAC GATGGGAGCA 

13 51 TCCGATGCTG TCTTTCGCTG CTGAGGGTGA CGATCCCGCA AAAGCGGCCT 

AGGCTACGAC AGAAAGCGAC GACTCCCACT GCTAGGGCGT TTTCGCCGGA 

14 01 TTGACTCCCT GCAAGCCTCA GCGACCGAAT ATATCGGTTA TGCGTGGGCG 

AACTGAGGGA CGTTCGGAGT CGCTGGCTTA TATAGCCAAT ACGCACCCGC 

14 51 ATGGTTGTTG TCATTGTCGG CGCAACTATC GGTATCAAGC TGTTTAAGAA 
TACCAACAAC AGTAACAGCC GCGTTGATAG CCATAGTTCG ACAAATTCTT 

1501 ATTCACCTCG AAAGCAAGCT GATAAAGGAG GTTTCTCGAT CGAGACGTTN 
TAAGTGGAGC TTTCGTTCGA CTATTTCCTC CAAAGAGCTA GCTCTGCAAN 

1551 NNNGAGGTTC CAACTTTCAC CATAATGAAA TAAGATCACT ACCGGGCGTA 
NNNCTCCAAG GTTGAAAGTG GTATTACTTT ATTCTAGTGA TGGCCCGCAT 

1601 TTTTTTGAGT TATCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA 
AAAAAACTCA ATAGCTCTAA AAGTCCTCGA TTCCTTCGAT TTTACCTCTT 

1651 AAAAATCACT GGATATACCA CCGTTGATAT ATCCCAATGG CATCGTAAAG 
TTTTTAGTGA CCTATATGGT GGCAACTATA TAGGGTTACC GTAGCATTTC 
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Figure 2D 6/39 

1701 AACATTTTGA GGCATTTCAG TCAGTTGCTC AATGTACCTA TAACCAGACC 
TTGTAAAACT CCGTAAAGTC AGTCAACGAG TTACATGGAT ATTGGTCTGG 

17 51 GTTCAGCTGG ATATTACGGC CTTTTTAAAG ACCGTAAAGA AAAATAAGCA 

CAAGTCGACC TATAATGCCG GAAAAATTTC TGGCATTTCT TTTTATTCGT 

18 01 CAAGTTTTAT CCGGCCTTTA TTCACATTCT TGCCCGCCTG ATGAATGCTC 

GTTCAAAATA GGCCGGAAAT AAGTGTAAGA ACGGGCGGAC TACTTACGAG 

1851 ATCCGGAGTT CCGTATGGCA ATGAAAGACG GTGAGCTGGT GATATGGGAT 
TAGGCCTCAA GGCATACCGT TACTTTCTGC CACTCGACCA CTATACCCTA 

1901 AGTGTTCACC CTTGTTACAC CGTTTTCCAT GAGCAAACTG AAACGTTTTC 
TCACAAGTGG GAACAATGTG GCAAAAGGTA CTCGTTTGAC TTTGCAAAAG 

1951 ATCGCTCTGG AGTGAATACC ACGACGATTT CCGGCAGTTT CTACACATAT 
TAGCGAGACC TCACTTATGG TGCTGCTAAA GGCCGTCAAA GATGTGTATA 

2 001 ATTCGCAAGA TGTGGCGTGT TACGGTGAAA ACCTGGCCTA TTTCCCTAAA 
TAAGCGTTCT ACACCGCACA ATGCCACTTT TGGACCGGAT AAAGGGATTT 

20 51 GGGTTTATTG AGAATATGTT TTTCGTCTCA GCCAATCCCT GGGTGAGTTT 
CCCAAATAAC TCTTATACAA AAAGCAGAGT CGGTTAGGGA CCCACTCAAA 

2101 CACCAGTTTT GATTTAAACG TGGCCAATAT GGACAACTTC TTCGCCCCCG 
GTGGTCAAAA CTAAATTTGC ACCGGTTATA CCTGTTGAAG AAGCGGGGGC 

Ncol 



2151 TTTTCACCAT GGGCAAATAT TATACGCAAG GCGACAAGGT GCTGATGCCG 
AAAAGTGGTA CCCGTTTATA ATATGCGTTC CGCTGTTCCA CGACTACGGC 

2201 CTGGCGATTC AGGTTCATCA TGCCGTCTGT GATGGCTTCC ATGTCGGCAG 
GACCGCTAAG TCCAAGTAGT ACGGCAGACA CTACCGAAGG TACAGCCGTC 

2251 AATGCTTAAT GAATTACAAC AGTACTGCGA TGAGTGGCAG GGCGGGGCGT 
TTACGAATTA CTTAATGTTG TCATGACGCT ACTCACCGTC CCGCCCCGCA 

23 01 AATTTTTTTA AGGCAGTTAT TGGTGCCCTT AAACGCCTGG TGCTACGCCT 
TTAAAAAAAT TCCGTCAATA ACCACGGGAA TTTGCGGACC ACGATGCGGA 

23 51 GAATAAGTGA TAATAAGCGG ATGAATGGCA GAAATTCGAA AGCAAATTCG 

CTTATTCACT ATTATTCGCC TACTTACCGT CTTTAAGCTT TCGTTTAAGC 

24 01 ACCCGGTCGT CGGTTCAGGG CAGGGTCGTT AAATAGCCGC TTATGTCTAT 

TGGGCCAGCA GCCAAGTCCC GTCCCAGCAA TTTATCGGCG AATACAGATA 

24 51 TGCTGGTTTA CCGGTTTATT GACTACCGGA AGCAGTGTGA CCGTGTGCTT 
ACGACCAAAT GGCCAAATAA CTGATGGCCT TCGTCACACT GGCACACGAA 

2 501 CTCAAATGCC TGAGGCCAGT TTGCTCAGGC TCTCCCCGTG GAGGTAATAA 
GAGTTTACGG ACTCCGGTCA AACGAGTCCG AGAGGGGCAC CTCCATTATT 
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2 551 TTGCTCGACC GATAAAAGCG GCTTCCTGAC AGGAGGCCGT TTTGTTTTGC 
AACGAGCTGG CTATTTTCGC CGAAGGACTG TCCTCCGGCA AAACAAAACG 

2601 AGCCCACCTC AACGCAATTA ATGTGAGTTA GCTCACTCAT TAGGCACCCC 
TCGGGTGGAG TTGCGTTAAT TACACTCAAT CGAGTGAGTA ATCCGTGGGG 

2651 AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG AATTGTGAGC 
TCCGAAATGT GAAATACGAA GGCCGAGCAT ACAACACACC TTAACACTCG 

27 01 GGATAACAAT TTCACACAGG AAACAGCTAT GACCATGATT ACGAATTTCT 

CCTATTGTTA AAGTGTGTCC TTTGTCGATA CTGGTACTAA TGCTTAAAGA 

2751 AGATAACGAG GGCAAATCAT GAAAAAGACA GCTATCGCGA TTGCAGTGGC 
TCTATTGCTC CCGTTTAGTA CTTTTTCTGT CGATAGCGCT AACGTCACCG 

28 01 ACTGGCTGGT TTCGCTACCG TAGCGCAGGC CGACTACAAA GATATCGTTA 

TGACCGACCA AAGCGATGGC ATCGCGTCCG GCTGATGTTT CTATAGCAAT 

2851 TGACCCAGTC ACCGTCCTCC CTGACCGTTA CCGCTGGTGA AAAAGTTACC 
ACTGGGTCAG TGGCAGGAGG GACTGGCAAT GGCGACCACT TTTTCAATGG 

2 901 ATGTCCTGCA CCTCCTCCCA GTCCCTGTTC AACTCCGGTA AACAGAAAAA 
TACAGGACGT GGAGGAGGGT CAGGGACAAG TTGAGGCCAT TTGTCTTTTT 

2 951 CTACCTGACC TGGTATCAGC AGAAACCGGG TCAGCCACCG AAAGTTCTGA 

GATGGACTGG ACCATAGTCG TCTTTGGCCC AGTCGGTGGC TTTCAAGACT 

3 001 TCTACTGGGC TTCCACCCGT GAATCCGGTG TTCCAGACCG TTTCACCGGT 

AGATGACCCG AAGGTGGGCA CTTAGGCCAC AAGGTCTGGC AAAGTGGCCA 

3 051 TCCGGTTCCG GCACCGACTT CACCCTGACC ATCTCCTCCG TTCAGGCTGA 
AGGCCAAGGC CGTGGCTGAA GTGGGACTGG TAGAGGAGGC AAGTCCGACT 

3101 AGACCTGGCT GTTTACTACT GCCAGAACGA CTACTCCAAC CCACTGACCT 
TCTGGACCGA CAAATGATGA CGGTCTTGCT GATGAGGTTG GGTGACTGGA 

3151 TCGGTGGTGG CACCAAACTG GAACTTAAGC GCGCTGGTGG TGGAGGGTCT 
AGCCACCACC GTGGTTTGAC CTTGAATTCG CGCGACCACC ACCTCCCAGA 

BamHI 



3201 GGAGGAGGTG GGAGTGGGGG AGGTGGATCC GGCGGGGGAG GTTCAGGGGG 
CCTCCTCCAC CCTCACCCCC TCCACCTAGG CCGCCCCCTC CAAGTCCCCC 

3251 TGGCGGTAGT GGAGGGGGCG GTTCAGAAGT TCAACTAGTT GAATCCGGTG 
ACCGCCATCA CCTCCCCCGC CAAGTCTTCA AGTTGATCAA CTTAGGCCAC 

3 3 01 GTGACCTGGT TAAACCGGGT GGTTCCCTGA AACTGTCCTG CGCTGCTTCC 
CACTGGACCA ATTTGGCCCA CCAAGGGACT TTGACAGGAC GCGACGAAGG 
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Figure 2F g/39 

3 3 51 GGTTTCTCCT TCTCCTCCTA CGGTATGTCC TGGGTTCGTC AGACCCCGGA 

CCAAAGAGGA AGAGGAGGAT GCCATACAGG ACCCAAGCAG TCTGGGGCCT 

34 01 CAAACGTCTG GAATGGGTTG CTACCATCTC CAACGGTGGT GGTTACACCT 

GTTTGCAGAC CTTACCCAAC GATGGTAGAG GTTGCCACCA CCAATGTGGA 

34 51 ACTACCCGGA CTCCGTTAAA GGTCGTTTCA CCATCTCCCG TGACAACGCT 

TGATGGGCCT GAGGCAATTT CCAGCAAAGT GGTAGAGGGC ACTGTTGCGA 

PstI 



3 501 AAAAACACCC TGTACCTGCA GATGTCCTCC CTGAAATCCG AAGACTCAGC 
TTTTTGTGGG ACATGGACGT CTACAGGAGG GACTTTAGGC TTCTGAGTCG 

3 551 TATGTACTAC TGCGCTCGTC GTGAACGTTA CGACGAAAAC GGTTTCGCTT 
ATACATGATG ACGCGAGCAG CACTTGCAAT GCTGCTTTTG CCAAAGCGAA 

EcoRI 



3 601 ACTGGGGTCA GGGTACCCTG GTTAC CGTTT CAGCTTCCGG AGAATTCGAG 
TGACCCCAGT CCCATGGGAC CAATGGCAAA . GTCGAAGGCC TCTTAAGCTC 

Aval 



3 651 GCCTCGGGGG CCGAGGGCGG CGGTTCTGGT TCCGGTGATT TTGATTATGA 

CGGAGCCCCC GGCTCCCGCC GCCAAGACCA AGGCCACTAA AACTAATACT 

3 701 AAAAATGGCA AACGCTAATA AGGGGGCTAT GACCGAAAAT GCCGATGAAA 

TTTTTACCGT TTGCGATTAT TCCCCCGATA CTGGCTTTTA CGGCTACTTT 

3751 ACGCGCTACA GTCTGACGCT AAAGGCAAAC TTGATTCTGT CGCTACTGAT 

TGCGCGATGT CAGACTGCGA TTTCCGTTTG AACTAAGACA GCGATGACTA 

Clal 



3801 TACGGTGCTG CTATCGATGG TTTCATTGGT GACGTTTCCG GCCTTGCTAA 
ATGCCACGAC GATAGCTACC AAAGTAACCA CTGCAAAGGC CGGAACGATT 

3 8 51 TGGTAATGGT GCTACTGGTG ATTTTGCTGG CTCTAATTCC CAAATGGCTC 
ACCATTACCA CGATGACCAC TAAAACGACC GAGATTAAGG GTTTACCGAG 

3 901 AAGTCGGTGA CGGTGATAAT TCACCTTTAA TGAATAATTT CCGTCAATAT 
TTCAGCCACT GCCACTATTA AGTGGAAATT ACTTATTAAA GGCAGTTATA 

3 951 TTACCTTCCC TCCCTCAATC GGTTGAATGT CGCCCTTTTG TCTTTGGCGC 

AATGGAAGGG AGGGAGTTAG CCAACTTACA GCGGGAAAAC AGAAACCGCG 

4 001 TGGTAAACCA TATGAATTTT CTATTGATTG TGACAAAATA AACTTATTCC 

ACCATTTGGT ATACTTAAAA GATAACTAAC ACTGTTTTAT TTGAATAAGG 

4 0 51 GTGGTGTCTT TGCGTTTCTT TTATATGTTG CCACGTTTAT GTATGTATTT 
CACCACAGAA ACGCAAAGAA AATATACAAC GGTGGAAATA CATACATAAA 
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Figure 2G 

Hindlll 



4101 TCTACGTTTG CTAACATACT GCGTAATAAG GAGTCTTGAT AAGCTTCGAG 
AGATGCAAAC GATTGTATGA CGCATTATTC CTCAGAACTA TTCGAAGCTC 

4151 AAATTCACCT CGAAAGCAAG CTGATAAACC GATACAATTA AAGGCTCCTT 
TTTAAGTGGA GCTTTCGTTC GACTATTTGG CTATGTTAAT TTCCGAGGAA 

EcoRI 



4 2 01 TTGGAGCCTT TTTTTTTGGA GAATTCAATC ATGCCAGTTC TTTTGGGTAT 
AACCTCGGAA AAAAAAACCT CTTAAGTTAG TACGGTCAAG AAAACCCATA 

4 2 51 TCCGTTATTA TTGCGTTTCC TCGGTTTCCT TCTGGTAACT TTGTTCGGCT 
AGGCAATAAT AACGCAAAGG AGCCAAAGGA AGACCATTGA AACAAGCCGA 

43 01 ATCTGCTTAC TTTCCTTAAA AAGGGCTTCG GTAAGATAGC TATTGCTATT 
TAGACGAATG AAAGGAATTT TTCCCGAAGC CATTCTATCG ATAACGATAA 

43 51 TCATTGTTTC TTGCTCTTAT TATTGGGCTT AACTCAATTC TTGTGGGTTA 

AGTAACAAAG AACGAGAATA ATAACCCGAA TTGAGTTAAG AAC AC CCAAT 

44 01 TCTCTCTGAT ATTAGCGCAC AATTACCCTC TGATTTTGTT CAGGGCGTTC 

AGAGAGACTA TAATCGCGTG TTAATGGGAG ACTAAAACAA GTCCCGCAAG 

44 51 AGTTAATTCT CCCGTCTAAT GCGCTTCCCT GTTTTTATGT TATTCTCTCT 
TCAATTAAGA GGGCAGATTA CGCGAAGGGA CAAAAATACA ATAAGAGAGA 

4 501 GTAAAGGCTG CTATTTTCAT TTTTGACGTT AAACAAAAAA TCGTTTCTTA 
CATTTCCGAC GATAAAAGTA AAAACTGCAA TTTGTTTTTT AG C AAAGAAT 

4 551 TTTGGATTGG GATAAATAAA TATGGCTGTT TATTTTGTAA CTGGCAAATT 
AAACCTAACC CTATTTATTT AT AC C G AC AA ATAAAACATT GACCGTTTAA 

4 601 AGGCTCTGGA AAGACGCTCG TTAGCGTTGG TAAGATTCAG GATAAAATTG 
TCCGAGACCT TTCTGCGAGC AATCGCAACC ATTCTAAGTC CTATTTTAAC 

4 651 TAGCTGGGTG CAAAATAGCA ACTAATCTTG ATTTAAGGCT TCAAAACCTC 
ATCGACCCAC GTTTTATCGT TGATTAGAAC TAAATTCCGA AGTTTTGGAG 

4 701 CCGCAAGTCG GGAGGTTCGC TAAAACGCCT CGCGTTCTTA GAATACCGGA 
GGCGTTCAGC CCTCCAAGCG ATTTTGCGGA GCGCAAGAAT CTTATGGCCT 

4751 TAAGCCTTCT ATTTCTGATT TGCTTGCTAT TGGTCGTGGT AATGATTCCT 
ATTCGGAAGA TAAAGACTAA ACGAACGATA ACCAGCACCA TTACTAAGGA 

4 801 ACGACGAAAA TAAAAACGGT TTGCTTGTTC TTGATGAATG CGGTACTTGG 
TGCTGCTTTT ATTTTTGCCA AACGAACAAG AACTACTTAC GCCATGAACC 

4 851 TTTAATACCC GTTCATGGAA TGACAAGGAA AGACAGCCGA TTATTGATTG 
AAATTATGGG CAAGTACCTT ACTGTTCCTT TCTGTCGGCT AATAACTAAC 
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Figure 2H 1Q/39 



4 901 GTTTCTTCAT GCTCGTAAAT TGGGATGGGA TATTATTTTT CTTGTTCAGG 
CAAAGAAGTA CG AG CATTTA ACCCTACCCT ATAATAAAAA GAACAAGTCC 



4 951 ATTTATCTAT TGTTGATAAA CAGGCGCGTT CTGCATTAGC TGAACACGTT 
TAAATAGATA ACAACTATTT GTCCGCGCAA GACGTAATCG ACTTGTGCAA 

5001 GTTTATTGTC GCCGTCTGGA CAGAATTACT TTACCCTTTG TCGGCACTTT 
CAAATAACAG CGGCAGACCT GTCTTAATGA AATGGGAAAC AGCCGTGAAA 

5051 ATATTCTCTT GTTACTGGCT CAAAAATGCC TCTGCCTAAA TTACATGTTG 
TATAAGAGAA CAATGACCGA GTTTTTACGG AGACGGATTT AATGTACAAC 

5101 GTGTTGTTAA ATATGGTGAT TCTCAATTAA GCCCTACTGT TGAGCGTTGG 
CACAACAATT TATACCACTA AGAGTTAATT CGGGATGACA ACTCGCAACC 

5151 CTTTATACTG GTAAGAATTT ATATAACGCA TATGACACTA AACAGGCTTT 
GAAATATGAC CATTC TTAAA TATATTGCGT ATACTGTGAT TTGTCCGAAA 

52 01 TTCCAGTAAT TATGATTCAG GTGTTTATTC ATATTTAACC CCTTATTTAT 

AAGGTCATTA ATACTAAGTC CACAAATAAG TATAAATTGG GGAATAAATA 

5251 CACACGGTCG GTATTTCAAA CCATTAAATT TAGGTCAGAA GATGAAATTA 
GTGTGCCAGC CATAAAGTTT GGTAATTTAA ATCCAGTCTT CTACTTTAAT 

53 01 ACTAAAATAT ATTTGAAAAA GTTTTCTCGC GTTCTTTGTC TTGCGATAGG 

TGATTTTATA TAAACTTTTT CAAAAGAGCG CAAGAAACAG AACGCTATCC 

53 51 ATTTGCATCA GCATTTACAT ATAGTTATAT AACCCAACCT AAGCCGGAGG 

TAAACGTAGT CGTAAATGTA TATCAATATA TTGGGTTGGA TTCGGCCTCC 

54 01 TTAAAAAGGT AGTCTCTCAG ACCTATGATT TTGATAAATT CACTATTGAC 

AATTTTTCCA TCAGAGAGTC TGGATACTAA AACTATTTAA GTGATAACTG 

54 51 TCTTCTCAGC GTCTTAATCT AAGCTATCGC TATGTTTTCA AGGATTCTAA 
AGAAGAGTCG CAGAATTAGA TTCGATAGCG ATACAAAAGT TCCTAAGATT 

5501 GGGAAAATTA ATTAATAGCG ACGATTTACA GAAGCAAGGT TATTCCATCA 
CCCTTTTAAT TAATTATCGC TGCTAAATGT CTTCGTTCCA ATAAGGTAGT 

5551 CATATATTGA TTTATGTACT GTTTCAATTA AAAAAGGTAA TTCAAATGAA 
GTATATAACT AAATACATGA CAAAGTTAAT TTTTTCCATT AAGTTTACTT 

56 01 ATTGTTAAAT GTAATTAATT TTGTTTTCTT GATGTTTGTT TCATCATCTT 

TAACAATTTA CATTAATTAA AACAAAAGAA CTACAAACAA AGTAGTAGAA 

5651 CTTTTGCTCA AGTAATTGAA ATGAATAATT CGCCTCTGCG CGATTTCGTG 
GAAAACGAGT TCATTAACTT TACTTATTAA GCGGAGACGC GCTAAAGCAC 

57 01 ACTTGGTATT CAAAGCAAAC AGGTGAATCT GTTATTGTCT CACCTGATGT 

TGAACCATAA GTTTCGTTTG TCCACTTAGA CAATAACAGA GTGGACTACA 
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Figure 21 11/39 

57 51 TAAAGGTACA GTGACTGTAT ATTCCTCTGA CGTTAAGCCT GAAAATTTAC 

ATTTCCATGT CACTGACATA TAAGGAGACT GCAATTCGGA CTTTTAAATG 

5801 GCAATTTCTT TATCTCTGTT TTACGTGCTA ATAATTTTGA TATGGTTGGC 
CGTTAAAGAA ATAGAGACAA AATGCACGAT TATTAAAACT ATACCAACCG 

58 51 TCAATTCCTT CCATAATTCA GAAATATAAC CCAAATAGTC AGGATTATAT 

AGTTAAGGAA GGTATTAAGT CTTTATATTG GGTTTATCAG TCCTAATATA 

5901 TGATGAATTG CCATCATCTG ATATTCAGGA ATATGATGAT AATTCCGCTC 
ACTACTTAAC GGTAGTAGAC TATAAGTCCT TATACTACTA TTAAGGCGAG 

59 51 CTTCTGGTGG TTTCTTTGTT CCGCAAAATG ATAATGTTAC TCAAACATTT 

GAAGACCACC AAAGAAACAA GGCGTTTTAC TATTACAATG AGTTTGTAAA 

6001 AAAATTAATA ACGTTCGCGC AAAGGATTTA ATAAGGGTTG TAGAATTGTT 
TTTTAATTAT TGCAAGCGCG TTTCCTAAAT TATTCCCAAC ATCTTAACAA 

60 51 TGTTAAATCT AATACATCTA AATCCTCAAA TGTATTATCT GTTGATGGTT 

ACAATTTAGA TTATGTAGAT TTAGGAGTTT ACATAATAGA CAACTACCAA 

6101 CTAACTTATT AGTAGTTAGC GCCCCTAAAG ATATTTTAGA TAACCTTCCG 
GATTGAATAA TCATCAATCG CGGGGATTTC TATAAAATCT ATTGGAAGGC 

6151 CAATTTCTTT CTACTGTTGA TTTGCCAACT GACCAGATAT TGATTGAAGG 
GTTAAAGAAA GATGACAACT AAACGGTTGA CTGGTCTATA ACTAACTTCC 

62 01 ATTAATTTTC GAGGTTCAGC AAGGTGATGC TTTAGATTTT TCCTTTGCTG 
TAATTAAAAG CTCCAAGTCG TTCCACTACG AAATCTAAAA AGGAAACGAC 

62 51 CTGGCTCTCA GCGCGGCACT GTTGCTGGTG GTGTTAATAC TGACCGTCTA 

GACCGAGAGT CGCGCCGTGA CAACGACCAC CACAATTATG ACTGGCAGAT 

63 01 ACCTCTGTTT TATCTTCTGC GGGTGGTTCG TTCGGTATTT TTAACGGCGA 

TGGAGACAAA ATAGAAGACG CCCACCAAGC AAG C CAT AAA AATTGCCGCT 

63 51 TGTTTTAGGG CTATCAGTTC GCGCATTAAA GACTAATAGC CATTCAAAAA 

ACAAAATCCC GATAGTCAAG CGCGTAATTT CTGATTATCG GTAAGTTTTT 

64 01 TATTGTCTGT GCCTCGTATT CTTACGCTTT CAGGTCAGAA GGGTTCTATT 

ATAACAGACA CGGAGCATAA GAATGCGAAA GTCCAGTCTT CCCAAGATAA 

64 51 TCTGTTGGCC AGAATGTCCC TTTTATTACT GGTCGTGTAA CTGGTGAATC 
AGACAACCGG TCTTACAGGG AAAATAATGA CCAGCACATT GACCACTTAG 

6501 TGCCAATGTA AATAATCCAT TTCAGACGGT TGAGCGTCAA AATGTTGGTA 
ACGGTTACAT TTATTAGGTA AAGTCTGCCA ACTCGCAGTT TTACAACCAT 

6 551 TTTCTATGAG TGTTTTTCCC GTTGCAATGG CTGGCGGTAA TATTGTTTTA 
AAAGATACTC ACAAAAAGGG CAACGTTACC GACCGCCATT ATAACAAAAT 
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6601 GATATAACCA GTAAGGCCGA TAGTTTGAGT TCTTCTACTC AGGCAAGTGA 
CTATATTGGT CATTCCGGCT ATCAAACTCA AGAAGATGAG TCCGTTCACT 



6 6 51 TGTTATTACT AATCAAAGAA GTATTGCGAC AACGGTTAAT TTGCGTGATG 
ACAATAATGA TTAGTTTCTT CATAACGCTG TTGCCAATTA AACGCACTAC 

67 01 GTCAGACTCT TTTGCTCGGT GGCCTCACTG ATTACAAAAA CACTTCTCAA 
CAGTCTGAGA AAACGAGCCA C CGGAGTG AC TAATGTTTTT GTGAAGAGTT 

67 51 GATTCTGGTG TGCCGTTCCT GTCTAAAATC CCTTTAATCG GCCTCCTGTT 
CTAAGACCAC ACGGCAAGGA CAGATTTTAG GGAAATT AG C CGGAGGACAA 



68 01 TAGCTCCCGT TCTGATTCTA ACGAGGAAAG CACGTTGTAC GTGCTCGTCA 
ATCGAGGGCA AGACTAAGAT TGCTCCTTTC GTGCAACATG CACGAGCAGT 

6851 AAGCAACCAT AGTACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT 
TTCGTTGGTA TCATGCGCGG GACATCGCCG CGTAATTCGC GCCGCCCACA 

6 901 GGTGGTTACG CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG 
CCACCAATGC GCGTCGCACT GGCGATGTGA ACGGTCGCGG GATCGCGGGC 

6 951 CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG CCACGTTCTC CGGCTTTCCC 
GAGGAAAGCG AAAGAAGGGA AGGAAAGAG C GGTGCAAGAG GCCGAAAGGG 



BamHI 



70 01 CGTCAAGCTC TAAATCGGGG GATCCCTTTA GGGTTCCGAT TTAGTGCTTT 
GCAGTTCGAG ATTTAGCCCC CTAGGGAAAT CCCAAGGCTA AATCACGAAA 

7051 ACGGCACCTC GACCTCCAAA AACTTGATTT GGGTGATGGT TCACGTAGTG 
TGCCGTGGAG CTGGAGGTTT TTGAACTAAA CCCACTACCA AGTGCATCAC 

7101 GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG 
CCGGTAGCGG GACTATCTGC CAAAAAGCGG GAAACTGCAA CCTCAGGTGC 

7151 TTCTTTAATA GTGGACTCTT GTTCCAAACT GGAACAACAC TCACAACTAA 
AAGAAATTAT CACCTGAGAA CAAGGTTTGA CCTTGTTGTG AGTGTTGATT 

72 01 CTCGGCCTAT TCTTTTGATT TATAAGGATT TTTGTCATTT TCTGCTTACT 
GAGCCGGATA AGAAAACTAA ATATTCCTAA AAACAGTAAA AGACGAATGA 

72 51 GGTTAAAAAA TAAGCTGATT TAACAAATAT TTAACGCGAA ATTTAACAAA 

CCAATTTTTT ATTCGACTAA ATTGTTTATA AATTGCGCTT TAAATTGTTT 

73 01 ACATTAACGT TTACAATTTA AATATTTGCT TATACAATCA TCCTGTTTTT 

TGTAATTG C A AATGTTAAAT TTATAAACGA ATATGTTAGT AG G AC AAAAA 

73 51 GGGGCTTTTC TGATTATCAA CCGGGGTACA TATGATTGAC ATGCTAGTTT 
CCCCGAAAAG ACTAATAGTT GGCCCCATGT ATACTAACTG TACGATCAAA 
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74 01 TACGATTACC GTTCATCGAT TCTCTTGTTT GCTCCAGACT TTCAGGTAAT 
ATGCTAATGG CAAGTAGCTA AGAGAACAAA CGAGGTCTGA AAGTCCATTA 

74 51 GACCTGATAG CCTTTGTAGA CCTCTCAAAA ATAGCTACCC TCTCCGGCAT 
CTGGACTATC GGAAACATCT GGAGAGTTTT TATCGATGGG AGAGGCCGTA 

7501 GAATTTATCA GCTAGAACGG TTGAATATCA TATTGACGGT GATTTGACTG 
CTTAAATAGT CGATCTTGCC AACTTATAGT ATAACTGCCA CTAAACTGAC 

7 551 TCTCCGGCCT TTCTCACCCG TTTGAATCTT TGCCTACTCA TTACTCCGGC 
AGAGGCCGGA AAGAGTGGGC AAACTTAGAA ACGGATGAGT AATGAGGCCG 

7601 ATTGCATTTA AAATATATGA GGGTTCTAAA AATTTTTATC CCTGCGTTGA 
TAACGTAAAT TTTATATACT CCCAAGATTT TTAAAAATAG GGACGCAACT 

7651 AATTAAGGCT TCACCAGCAA AAGTATTACA GGGTCATAAT GTTTTTGGTA 
TTAATTCCGA AGTGGTCGTT TTCATAATGT CCCAGTATTA CAAAAACCAT 

7701 CAACCGATTT AGCTTTATGC TCTGAGGCTT TATTGCTTAA TTTTGCTAAC 
GTTGGCTAAA TCGAAATACG AGACTCCGAA ATAACGAATT AAAACGATTG 

77 51 TCTCTGCCTT GCTTGTACGA TTTATTGGAT GTT 
AGAGACGGAA CGAACATGCT AAATAACCTA CAA 
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1 AACGCTACTA C C ATT AGTAG AATTGATGCC ACCTTTTCAG CTCGCGCCCC 
TTGCGATGAT GGTAATCATC TTAACTACGG TGGAAAAGTC GAGCGCGGGG 

51 AAATGAAAAT ATAGCTAAAC AGGTTATTGA CCATTTGCGA AATGTATCTA 
TTTACTTTTA TATCGATTTG TCCAATAACT GGTAAACGCT TTACATAGAT 

101 ATGGTCAAAC TAAATCTACT CGTTCGCAGA ATTGGGAATC AACTGTTACA 
TACCAGTTTG ATTTAGATGA GCAAGCGTCT TAACCCTTAG TTGACAATGT 

151 TGGAATGAAA CTTCCAGACA CCGTACTTTA GTTGCATATT TAAAACATGT 
ACCTTACTTT GAAGGTCTGT GGCATGAAAT CAACGTATAA ATTTTGTACA 

2 01 TGAACTACAG CACCAGATTC AGCAATTAAG CTCTAAGCCA TCCGCAAAAA 
ACTTGATGTC GTGGTCTAAG TCGTTAATTC GAGATTCGGT AGGCGTTTTT 

2 51 TGACCTCTTA TCAAAAGGAG CAATTAAAGG TACTGTCTAA TCCTGACCTG 

ACTGGAGAAT AGTTTTCCTC GTTAATTTCC ATGACAGATT AGGACTGGAC 

3 01 TTGGAATTTG CTTCCGGTCT GGTTCGCTTT GAGGCTCGAA TTGAAACGCG 

AACCTTAAAC GAAGGCCAGA CCAAGCGAAA CTCCGAGCTT AACTTTGCGC 

3 51 ATATTTGAAG TCTTTCGGGC TTCCTCTTAA TCTTTTTGAT GCAATTCGCT 

TATAAACTTC AGAAAGCCCG AAGGAGAATT AGAAAAACTA CGTTAAGCGA 

4 01 TTGCTTCTGA C T AT AAT AG A CAGGGTAAAG ACCTGATTTT TGATTTATGG 

AACGAAGACT GATATTATCT GTCCCATTTC TGGACTAAAA ACTAAATACC 

4 51 TCATTCTCGT TTTCTGAACT GTTTAAAGCA TTTGAGGGGG ATTCAATGAA 
AG TAAG AG C A AAAGACTTGA CAAATTTCGT AAACTCCCCC TAAGTTACTT 

501 TATTTATGAC GATTCCGCAG TATTGGACGC TATCCAGTCT AAACATTTTA 
ATAAATACTG CTAAGGCGTC ATAACCTGCG ATAGGTCAGA TTTGTAAAAT 

551 CAATTACCCC CTCTGGCAAA ACTTCCTTTG CAAAAGCCTC TCGCTATTTT 
GTTAATGGGG GAGACCGTTT TGAAGGAAAC GTTTTCGGAG AGCGATAAAA 

601 GGTTTCTATC GTCGTCTGGT TAATGAGGGT TATGATAGTG TTGCTCTTAC 
C C AAAG AT AG CAGCAGACCA ATTACTCCCA ATACTATCAC AACGAGAATG 

651 CATGCCTCGT AATTCCTTTT GGCGTTATGT ATCTGCATTA GTTGAGTGTG 
GTACGGAGCA TTAAGGAAAA CCGCAATACA TAGACGTAAT CAACTCACAC 

7 01 GTATTCCTAA ATCTCAATTG ATGAATCTTT CCACCTGTAA TAATGTTGTT 
CATAAGGATT TAGAGTTAAC TACTTAGAAA GGTGGACATT ATTACAACAA 

751 CCGTTAGTTC GTTTTATTAA CGTAGATTTT TCCTCCCAAC GTCCTGACTG 
GGCAATCAAG CAAAATAATT GCATCTAAAA AGGAGGGTTG CAGGACTGAC 



801 



GTATAATGAG CCAGTTCTTA AAATCGCATA AGGTAATTCA AAATGATTAA 
CATATTACTC GGTCAAGAAT TTTAGCGTAT TCCATTAAGT TTTACTAATT 
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851 AGTTGAAATT AAACCGTCTC AAGCGCAATT TACTACCCGT TCTGGTGTTT 
TCAACTTTAA TTTGGCAGAG TTCGCGTTAA ATGATGGGCA AGAC C AC AAA 

901 CTCGTCAGGG CAAGCCTTAT TCACTGAATG AGCAGCTTTG TTACGTTGAT 
GAGCAGTCCC GTTCGGAATA AGTGACTTAC TCGTCGAAAC AATGCAACTA 

951 TTGGGTAATG AATATCCGGT GCTTGTCAAG ATTACTCTCG ACGAAGGTCA 
AACCCATTAC TTATAGGCCA CGAACAGTTC T AATG AG AG C TGCTTCCAGT 

1001 GCCAGCGTAT GCGCCTGGTC TGTACACCGT GCATCTGTCC TCGTTCAAAG 
CGGTCGCATA CGCGGACCAG ACATGTGGCA CG TAG AC AGG AGCAAGTTTC 

1051 TTGGTCAGTT CGGTTCTCTT ATGATTGACC GTCTGCGCCT CGTTCCGGCT 
AACCAGTCAA GCCAAGAGAA TACTAACTGG CAGACGCGGA GCAAGGCCGA 

1101 AAGTAACATG GAGCAGGTCG CGGATTTCGA CACAATTTAT CAGGCGATGA 
TTCATTGTAC CTCGTCCAGC GCCTAAAGCT GTGTTAAATA GTCCGCTACT 

1151 TACAAATCTC CGTTGTACTT TGTTTCGCGC TTGGTATAAT CGCTGGGGGT 
ATGTTTAGAG GCAACATGAA ACAAAGCGCG AACCATATTA GCGACCCCCA 

12 01 CAAAGATGAG TGTTTTAGTG TATTCTTTCG CCTCTTTCGT TTTAGGTTGG 
GTTTCTACTC ACAAAATCAC ATAAGAAAGC GGAGAAAGCA AAATCCAACC 

12 51 TGCCTTCGTA GTGGCATTAC GTATTTTACC CGTTTAATGG AAACTTCCTC 

ACGGAAGCAT CACCGTAATG CATAAAATGG GC AAATTAC C TTTGAAGGAG 

13 01 ATGCGTAAGT CTTTAGTCCT CAAAGCCTCC GTAGCCGTTG CTACCCTCGT 

TACGCATTCA GAAATCAGGA GTTTCGGAGG CATCGGCAAC GATGGGAGCA 

13 51 TCCGATGCTG TCTTTCGCTG CTGAGGGTGA CGATCCCGCA AAAGCGGCCT 

AGGCTACGAC AGAAAGCGAC GACTCCCACT GCTAGGGCGT TTTCGCCGGA 

14 01 TTGACTCCCT GCAAGCCTCA GCGACCGAAT ATATCGGTTA TGCGTGGGCG 

AACTGAGGGA CGTTCGGAGT CGCTGGCTTA TATAGCCAAT ACGCACCCGC 

14 51 ATGGTTGTTG TCATTGTCGG CGCAACTATC GGTATCAAGC TGTTTAAGAA 
TACCAACAAC AGTAACAGCC GCGTTGATAG CCATAGTTCG ACAAATTCTT 

1501 ATTCACCTCG AAAGCAAGCT GATAAAGGAG GTTTCTCGAT CGAGACGTTN 
TAAGTGGAGC TTTCGTTCGA CTATTTCCTC CAAAGAGCTA GCTCTGCAAN 

1551 NNNGAGGTTC CAACTTTCAC CATAATGAAA TAAGATCACT ACCGGGCGTA 
NNNCTCCAAG GTTGAAAGTG GTATTACTTT ATTCTAGTGA TGGCCCGCAT 

1601 TTTTTTGAGT TATCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA 
AAAAAACTCA ATAGCTCTAA AAGTCCTCGA TTCCTTCGAT TTTACCTCTT 

1651 AAAAATCACT GG ATATACC A CCGTTGATAT ATCCCAATGG CATCGTAAAG 
TTTTTAGTGA CCTATATGGT GGCAACTATA TAGGGTTACC GT AG C ATTTC 
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17 01 AACATTTTGA GGCATTTCAG TCAGTTGCTC AATGTACCTA TAACCAGACC 

TTGTAAAACT CCGTAAAGTC AGTCAACGAG TTACATGGAT ATTGGTCTGG 

1751 GTTCAGCTGG ATATTACGGC CTTTTTAAAG ACCGTAAAGA AAAATAAGCA 
CAAGTCGACC TATAATGCCG G AAAAATTT C TGGCATTTCT TTTTATTCGT 

18 01 CAAGTTTTAT CCGGCCTTTA TTCACATTCT TGCCCGCCTG ATGAATGCTC 

GTTCAAAATA GGCCGGAAAT AAGTGTAAGA ACGGGCGGAC TACTTACGAG 

18 51 ATCCGGAGTT CCGTATGGCA ATGAAAGACG GTGAGCTGGT GATATGGGAT 
TAGGCCTCAA GGCATACCGT TACTTTCTGC CACTCGACCA C TAT AC C C T A 

1901 AGTGTTCACC CTTGTTACAC CGTTTTCCAT GAGCAAACTG AAACGTTTTC 
TCACAAGTGG GAACAATGTG GCAAAAGGTA CTCGTTTGAC TTTGCAAAAG 

1951 ATCGCTCTGG AGTGAATACC ACGACGATTT CCGGCAGTTT CTACACATAT 
TAGCGAGACC TCACTTATGG TGCTGCTAAA GGCCGTCAAA GATGTGTATA 

2 001 ATTCGCAAGA TGTGGCGTGT TACGGTGAAA ACCTGGCCTA TTTCCCTAAA 
TAAGCGTTCT ACACCGCACA ATGCCACTTT TGGACCGGAT AAAGGGATTT 

2 0 51 GGGTTTATTG AGAATATGTT TTTCGTCTCA GCCAATCCCT GGGTGAGTTT 
CCCAAATAAC TCTTATACAA AAAGCAGAGT CGGTTAGGGA CCCACTCAAA 

2101 CACCAGTTTT GATTTAAACG TAG C C AAT AT GGACAACTTC TTCGCCCCCG 
GTGGTCAAAA CTAAATTTGC ATCGGTTATA CCTGTTGAAG AAGCGGGGGC 

2151 TTTTCACTAT GGGCAAATAT TATACGCAAG GCGACAAGGT GCTGATGCCG 
AAAAGTGATA CCCGTTTATA ATATGCGTTC CGCTGTTCCA CGACTACGGC 

22 01 CTGGCGATTC AGGTTCATCA TGCCGTTTGT GATGGCTTCC ATGTCGGCAG 

GACCGCTAAG TCCAAGTAGT ACGG CAAACA CTACCGAAGG TACAGCCGTC 

2 251 AATGCTTAAT GAATTACAAC AGTACTGCGA TGAGTGGCAG GGCGGGGCGT 
TTACGAATTA CTTAATGTTG TCATGACGCT ACTCACCGTC CCGCCCCGCA 

2 3 01 AATTTTTTTA AGGCAGTTAT TGGTGCCCTT AAACGCCTGG TGCTAGCCTG 
TTAAAAAAAT TCCGTCAATA ACCACGGGAA TTTGCGGACC ACGATCGGAC 

23 51 AGGCCAGTTT GCTCAGGCTC TCCCCGTGGA GGTAATAATT GCTCGACCGA 

TCCGGTCAAA CGAGTCCGAG AGGGGCACCT CCATTATTAA CGAGCTGGCT 

24 01 TAAAAGCGGC TTCCTGACAG GAGGCCGTTT TGTTTTGCAG CCCACCTCAA 

ATTTTCGCCG AAGGACTGTC CTCCGGCAAA ACAAAACGTC GGGTGGAGTT 

24 51 CGCAATTAAT GTGAGTT AG C TCACTCATTA GGCACCCCAG GCTTTACACT 
GCGTTAATTA CACTCAATCG AGTGAGTAAT CCGTGGGGTC CGAAATGTGA 

2 501 TTATGCTTCC GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT 
AATACGAAGG CCGAGCATAC AACACACCTT AACACTCGCC TATTGTTAAA 
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2 551 CACACAGGAA ACAGCTATGA C CATGATT AC GAATTTCTAG ATAACGAGGG 
GTGTGTCCTT TGTCGATACT GGTACTAATG CTTAAAGATC TATTGCTCCC 

2 601 CAAAAAATGA AAAAGACAGC T ATCG CGATT GCAGTGGCAC TGGCTGGTTT 
GTTTTTTACT TTTTCTGTCG ATAGCGCTAA CGTCACCGTG ACCGACCAAA 

26 51 CGCTACCGTA GCGCAGGCCG ACTACAAAGA TGTCGACGCC GGTGGTCGGA 
GCGATGGCAT CGCGTCCGGC TGATGTTTCT ACAGCTGCGG CCACCAGCCT 

2701 TCGCCCGGCT AGAGGAAAAA GTGAAAACCT TGAAAGCGCA AAACTCCGAG 
AGCGGGCCGA TCTCCTTTTT CACTTTTGGA ACTTTCGCGT TTTGAGGCTC 

2751 CTGGCGTCCA CGGCCAACAT GCTCAGGGAA CAGGTGGCAC AG CTTAAACA 
GACCGCAGGT GCCGGTTGTA CGAGTCCCTT GTCCACCGTG TCGAATTTGT 

EcoRI 



2 801 GAAAGTCATG AACCACGGTG GTGCCGAATT CAATGCTGGC GGCGGCTCTG 

CTTTCAGTAC TTGGTGCCAC CACGGCTTAA GTTACGACCG CCGCCGAGAC 

2851 GTGGTGGTTC TGGTGGCGGC TCTGAGGGTG GTGGCTCTGA GGGTGGCGGT 

CACCACCAAG ACCACCGCCG AGACTCCCAC CACCGAGACT CCCACCGCCA 

2 901 TCTGAGGGTG GCGGCTCTGA GGGAGGCGGT TCCGGTGGTG GCTCTGGTTC 

AGACTCCCAC CGCCGAGACT CCCTCCGCCA AGGCCACCAC CGAGACCAAG 

2 951 CGGTGATTTT GATTATGAAA AGATGGCAAA CGCTAATAAG GGGGCTATGA 

GCCACTAAAA CTAATACTTT TCTACCGTTT G CGATT ATT C CCCCGATACT 

3 001 CCGAAAATGC CGATGAAAAC GCGCTACAGT CTGACGCTAA AGGCAAACTT 

GGCTTTTACG GCTACTTTTG CGCGATGTCA GACTGCGATT TCCGTTTGAA 

Clal 



3 051 GATTCTGTCG CTACTGATTA CGGTGCTGCT ATCGATGGTT TCATTGGTGA 
CTAAGACAGC GATGACTAAT GCCACGACGA TAGCTACCAA AGTAACCACT 

3101 CGTTTCCGGC CTTGCTAATG GTAATGGTGC TACTGGTGAT TTTGCTGGCT 
GCAAAGGCCG GAACGATTAC CATTACCACG ATGACCACTA AAACGACCGA 

3151 CTAATTCCCA AATGGCTCAA GTCGGTGACG GTGATAATTC AC CTTTAATG 
GATTAAGGGT TTACCGAGTT CAGCCACTGC CACTATTAAG TGGAAATTAC 

32 01 AATAATTTCC GTCAATATTT ACCTTCCCTC CCTCAATCGG TTGAATGTCG 
TTATTAAAGG CAGTTATAAA TGGAAGGGAG GGAGTTAGCC AACTTACAGC 

32 51 CCCTTTTGTC TTTAGCGCTG GTAAACCATA TGAATTTTCT ATTGATTGTG 
GGGAAAACAG AAATCGCGAC CATTTGGTAT ACTTAAAAGA TAACTAACAC 

3301 ACAAAATAAA CTTATTCCGT GGTGTCTTTG CGTTTCTTTT ATATGTTGCC 
TGTTTTATTT GAATAAGGCA CCACAGAAAC GCAAAGAAAA TATACAACGG 
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33 51 ACCTTTATGT ATGTATTTTC TACGTTTGCT AACATACTGC GTAATAAGGA 
TGGAAATACA TACATAAAAG ATG CAAACG A TTGTATGACG CATTATTCCT 



Hindi I I 



3 4 01 GTCTTGATAA GCTTCGAGAA ATTCACCTCG AAAGCAAGCT GATAAACCGA 
CAGAACTATT CGAAGCTCTT TAAGTGGAGC TTTCGTTCGA CTATTTGGCT 

34 51 TACAATTAAA GGCTCCTTTT GGAGCCTTTT TTTTTGGAGA ATTAATTCAA 
ATGTTAATTT CCGAGGAAAA CCTCGGAAAA AAAAACCTCT TAATTAAGTT 

3 501 TCATGCCAGT TCTTTTGGGT ATTCCGTTAT TATTGCGTTT CCTCGGTTTC 
AGTACGGTCA AGAAAACCCA TAAGGCAATA AT AACG C AAA GGAGCCAAAG 

3 551 CTTCTGGTAA CTTTGTTCGG CTATCTGCTT ACTTTCCTTA AAAAGGGCTT 
GAAGACCATT GAAACAAGCC GATAGACGAA TGAAAGGAAT TTTTCCCGAA 

3601 CGGTAAGATA GCTATTGCTA TTTCATTGTT TCTTGCTCTT ATTATTGGGC 
GCCATTCTAT CGATAACGAT AAAGTAACAA AGAACGAGAA TAATAACCCG 

3 651 TTAACTCAAT TCTTGTGGGT TATCTCTCTG ATATTAGCGC ACAATTACCC 
AATTGAGTTA AGAACACCCA ATAGAGAGAC TATAATCGCG TGTTAATGGG 

3 701 TCTGATTTTG TTCAGGGCGT TCAGTTAATT CTCCCGTCTA ATGCGCTTCC 
AGACTAAAAC AAGTCCCGCA AGTCAATTAA GAGGGCAGAT TACGCGAAGG 

3 751 CTGTTTTTAT GTTATTCTCT CTGTAAAGGC TGCTATTTTC ATTTTTGACG 
GACAAAAATA CAATAAGAGA GACATTTCCG ACGATAAAAG TAAAAACTGC 

3 801 TTAAACAAAA AATCGTTTCT TATTTGGATT GGGATAAATA AATATGGCTG 
AATTTGTTTT TT AG C AAAGA AT AAAC CTAA CCCTATTTAT TTATACCGAC 

3 851 TTTATTTTGT AACTGGCAAA TTAGGCTCTG GAAAGACGCT CGTTAGCGTT 
AAATAAAACA TTGACCGTTT AATCCGAGAC CTTTCTGCGA GCAATCGCAA 

3 901 GGTAAGATTC AGGATAAAAT TGTAGCTGGG TGCAAAATAG CAACTAATCT 
CCATTCTAAG TCCTATTTTA ACATCGACCC ACGTTTTATC GTTGATTAGA 

3 951 TGATTTAAGG CTTCAAAACC TCCCGCAAGT CGGGAGGTTC GCTAAAACGC 

ACTAAATTCC GAAGTTTTGG AGGGCGTTCA GCCCTCCAAG CGATTTTGCG 

4 001 CTCGCGTTCT TAG AATAC CG GATAAGCCTT CTATTTCTGA TTTGCTTGCT 

GAGCGCAAGA ATCTTATGGC CTATTCGGAA GATAAAGACT AAACGAACGA 

4 051 ATTGGTCGTG GTAATGATTC CTACGACGAA AATAAAAACG GTTTGCTTGT 
TAACCAGCAC CATTACTAAG GATGCTGCTT TTATTTTTGC CAAACGAACA 

4101 TCTTGATGAA TGCGGTACTT GGTTTAATAC CCGTTCATGG AATGACAAGG 
AGAACTACTT ACGCCATGAA CCAAATTATG GGCAAGTACC TTACTGTTCC 
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4151 AAAG AC AG C C GATTATTGAT TGGTTTCTTC ATGCTCGTAA ATTGGGATGG 
TTTCTGTCGG CTAATAACTA ACCAAAGAAG TACGAGCATT TAACCCTACC 

4201 GATATTATTT TTCTTGTTCA GGATTTATCT ATTGTTGATA AACAGGCGCG 
CTATAATAAA AAGAACAAGT CCTAAATAGA TAACAACTAT TTGTCCGCGC 

4 251 TTCTGCATTA GCTGAACACG TTGTTTATTG TCGCCGTCTG GACAGAATTA 
AAGACGTAAT CGACTTGTGC AACAAATAAC AGCGGCAGAC CTGTCTTAAT 

4 301 CTTTACCCTT TGTCGGCACT TTATATTCTC TTGTTACTGG CTCAAAAATG 
GAAATGGGAA ACAGCCGTGA AATATAAGAG AACAATGACC GAGTTTTTAC 

4 3 51 CCTCTGCCTA AATTACATGT TGGTGTTGTT AAATATGGTG ATTCTCAATT 
GGAGACGGAT TTAATGTACA ACCACAACAA TTTATAC C AC TAAGAGTTAA 

44 01 AAGCCCTACT GTTGAGCGTT GGCTTTATAC TGGTAAGAAT TTATATAACG 
TTCGGGATGA CAACTCGCAA CCGAAATATG ACCATTCTTA AATATATTGC 

44 51 CATATGACAC TAAACAGGCT TTTTCCAGTA ATTATGATTC AGGTGTTTAT 
GTATACTGTG ATTTGTCCGA AAAAGGTCAT TAATACTAAG TCCACAAATA 

4 501 TCATATTTAA CCCCTTATTT ATCACACGGT CGGTATTTCA AACCATTAAA 
AGTATAAATT GGGGAATAAA TAGTGTGCCA GCCATAAAGT TTGGTAATTT 

4 551 TTTAGGTCAG AAGATGAAAT TAACTAAAAT ATATTTGAAA AAGTTTTCTC 
AAATCCAGTC TTCTACTTTA ATTGATTTTA TATAAACTTT TTCAAAAGAG 

4 601 GCGTTCTTTG TCTTGCGATA GGATTTGCAT CAGCATTTAC ATATAGTTAT 
CGCAAGAAAC AGAACGCTAT CCTAAACGTA GTCGTAAATG TATATCAATA 

4 651 ATAACCCAAC CTAAGCCGGA GGTTAAAAAG GTAGTCTCTC AGACCTATGA 
TATTGGGTTG GATTCGGCCT CCAATTTTTC CATCAGAGAG TCTGGATACT 

4701 TTTTGATAAA TTCACTATTG ACTCTTCTCA GCGTCTTAAT CTAAGCTATC 
AAAACTATTT AAGTGATAAC TGAGAAGAGT CGCAGAATTA GATTCGATAG 

4 751 GCTATGTTTT CAAGGATTCT AAGGGAAAAT TAATTAATAG CGACGATTTA 
CGATACAAAA GTTCCTAAGA TTCCCTTTTA ATTAATTATC GCTGCTAAAT 

4 801 CAGAAGCAAG GTTATTCCAT CACATATATT GATTTATGTA CTGTTTCAAT 
GTCTTCGTTC CAATAAGGTA GTGTATATAA CTAAATACAT GACAAAGTTA 

4 851 TAAAAAAGGT AATTCAAATG AAATTGTTAA ATGTAATTAA TTTTGTTTTC 
ATTTTTTCCA TTAAGTTTAC TTTAACAATT TACATTAATT AAAACAAAAG 

4 901 TTGATGTTTG TTTCATCATC TTCTTTTGCT CAAGTAATTG AAATGAATAA 
AACTACAAAC AAAGTAGTAG AAGAAAACGA GTTCATTAAC TTTACTTATT 



4951 



TTCGCCTCTG CGCGATTTCG TGACTTGGTA TTCAAAGCAA ACAGGTGAAT 
AAGCGGAGAC GCGCTAAAGC ACTGAACCAT AAGTTTCGTT TGTCCACTTA 
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5 001 CTGTTATTGT CTCACCTGAT GTTAAAGGTA CAGTGACTGT ATATTCCTCT 
GACAATAACA GAGTGGACTA CAATTTCCAT GTCACTGACA TATAAGGAGA 



5051 GACGTTAAGC CTGAAAATTT ACGCAATTTC TTTATCTCTG TTTTACGTGC 
CTGCAATTCG GACTTTTAAA TGCGTTAAAG AAATAGAGAC AAAATGCACG 



5101 TAATAATTTT GATATGGTTG GCTCAATTCC TTCCATAATT CAGAAATATA 
ATTATTAAAA CTATACCAAC CGAGTTAAGG AAGGTATTAA GTCTTTATAT 

5151 ACCCAAATAG TCAGGATTAT ATTGATGAAT TGCCATCATC TGATATTCAG 
TGGGTTTATC AGTCCTAATA TAACTACTTA ACGGTAGTAG ACTATAAGTC 

52 01 GAATATGATG ATAATTCCGC TCCTTCTGGT GGTTTCTTTG TTCCGCAAAA 

CTTATACTAC TATTAAGGCG AGGAAGACCA CCAAAGAAAC AAGGCGTTTT 

5251 TGATAATGTT ACTCAAACAT TTAAAATTAA TAACGTTCGC GCAAAGGATT 
ACTATTACAA TGAGTTTGTA AATTTTAATT ATTGCAAGCG CGTTTCCTAA 

53 01 TAATAAGGGT TGTAGAATTG TTTGTTAAAT CTAATACATC TAAATCCTCA 

ATTATTCCCA ACATCTTAAC AAACAATTTA GATTATGTAG ATTTAGGAGT 

53 51 AATGTATTAT CTGTTGATGG TTCTAACTTA TTAGTAGTTA GCGCCCCTAA 
TTACATAATA GACAACTACC AAGATTGAAT AAT CAT C AAT CGCGGGGATT 



54 01 AGATATTTTA GATAACCTTC 
TCTATAAAAT CTATTGGAAG 

54 51 C TG A CC AG AT ATTGATTGAA 
GACTGGTCTA TAACTAACTT 

5 501 GCTTTAGATT TTTCCTTTGC 
CGAAATCTAA AAAGGAAACG 

5551 TGGTGTTAAT ACTGACCGTC 
ACCACAATTA TGACTGGCAG 

5601 CGTTCGGTAT TTTTAACGGC 
G C AAG C CAT A AAAATTGCCG 

5651 AAGACTAATA GCCATTCAAA 
TTCTGATTAT CGGTAAGTTT 

5701 TTCAGGTCAG AAGGGTTCTA 
AAGTCCAGTC TTCCCAAGAT 

5751 CTGGTCGTGT AACTGGTGAA 
GACCAGCACA TTGACCACTT 



CGCAATTTCT TTCTACTGTT GATTTGCCAA 
GCGTTAAAGA AAGATGACAA CTAAACGGTT 

GGATTAATTT TCGAGGTTCA G C AAGGTGAT 
C C T AATT AAA AGCTCCAAGT CGTTCCACTA 

TGCTGGCTCT CAGCGCGGCA CTGTTGCTGG 
ACG AC CG AG A GTCGCGCCGT GACAACGACC 

TAACCTCTGT TTTATCTTCT GCGGGTGGTT 
ATTGGAGACA AAATAGAAGA CGCCCACCAA 

GATGTTTTAG GGCTATCAGT TCGCGCATTA 
CTACAAAATC CCGATAGTCA AGCGCGTAAT 

AATATTGTCT GTGCCTCGTA TTCTTACGCT 
TTATAACAGA C ACGG AG CAT AAGAATGCGA 

TTTCTGTTGG CCAGAATGTC CCTTTTATTA 
AAAGACAACC GGTCTTACAG GGAAAATAAT 

TCTGCCAATG TAAATAATCC ATTTCAGACG 
AGACGGTTAC ATTTATTAGG TAAAGTCTGC 



5801 



GTTGAGCGTC AAAATGTTGG TATTTCTATG AGTGTTTTTC CCGTTGCAAT 
CAACTCGCAG TTTTACAACC ATAAAGATAC TCACAAAAAG GGCAACGTTA 
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Figure 31 ^ 

5851 GGCTGGCGGT AATATTGTTT TAGATATAAC CAGTAAGGCC GATAGTTTGA 
CCGACCGCCA TTATAACAAA ATCTATATTG GTCATTCCGG CTATCAAACT 

5901 GTTCTTCTAC TCAGGCAAGT GATGTTATTA CTAATCAAAG AAGTATTGCG 
CAAGAAGATG AGTCCGTTCA CTACAATAAT GATTAGTTTC TTCATAACGC 

5951 ACAACGGTTA ATTTGCGTGA TGGTCAGACT CTTTTGCTCG GTGGCCTCAC 
TGTTGCCAAT TAAACGCACT ACCAGTCTGA GAAAACGAGC CACCGGAGTG 

6001 TGATTACAAA AACACTTCTC AAGATTCTGG TGTGCCGTTC CTGTCTAAAA 
ACTAATGTTT TTGTGAAGAG TTCTAAGACC ACACGGCAAG GACAGATTTT 

60 51 TCCCTTTAAT CGGCCTCCTG TTTAGCTCCC GTTCTGATTC TAACGAGGAA 
AGGGAAATTA GCCGGAGGAC AAATCGAGGG CAAGACTAAG ATTGCTCCTT 

6101 AGCACGTTGT ACGTGCTCGT CAAAGCAACC ATAGTACGCG CCCTGTAGCG 
TCGTGCAACA TGCACGAGCA GTTTCGTTGG TATCATGCGC GGGACATCGC 

6151 GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
CGCGTAATTC GCGCCGCCCA CACCACCAAT GCGCGTCGCA CTGGCGATGT 

6201 CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT 
GAACGGTCGC GGGATCGCGG GCGAGGAAAG CGAAAGAAGG GAAGGAAAGA 

BamHI 



62 51 CGCCACGTTC TCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGATCCCTT 

GCGGTGCAAG AGGCCGAAAG GGGCAGTTCG AGATTTAGCC CCCTAGGGAA 

6 301 TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCTCCA AAAACTTGAT 
ATCCCAAGGC TAAATCACGA AATGCCGTGG AGCTGGAGGT TTTTGAACTA 

63 51 TTGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG 

AACCCACTAC CAAGTGCATC ACCCGGTAGC GGGACTATCT GCCAAAAAGC 

64 01 CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 

GGGAAACTGC AACCTCAGGT GCAAGAAATT ATCACCTGAG AACAAGGTTT 

64 51 CTGGAACAAC ACTCACAACT AACTCGGCCT ATTCTTTTGA TTTATAAGGA 

GACCTTGTTG TGAGTGTTGA TTGAGCCGGA TAAGAAAACT AAATATTCCT 

6501 TTTTTGTCAT TTTCTGCTTA CTGGTTAAAA AATAAGCTGA TTTAACAAAT 
AAAAACAGTA AAAGACGAAT GACCAATTTT TTATTCGACT AAATTGTTTA 

65 51 ATTTAACGCG AAATTTAACA AAACATTAAC GTTTACAATT TAAATATTTG 

TAAATTGCGC TTTAAATTGT TTTGTAATTG CAAATGTTAA ATTTATAAAC 



6601 



CTTATACAAT CATCCTGTTT TTGGGGCTTT TCTGATTATC AACCGGGGTA 
GAATATGTTA GTAGGACAAA AACCCCGAAA AGACTAATAG TTGGCCCCAT 
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Figure 3J 

6651 CATATGATTG 
GTATACTAAC 

67 01 TTGCTCCAGA 
AACGAGGTCT 

67 51 AAATAGCTAC 
TTTATCGATG 

6801 CATATTGACG 
GTATAACTGC 

6851 TTTGCCTACT 
AAACGGATGA 

6901 AAAATTTTTA 
TTTTAAAAAT 

6951 CAGGGTCATA 
GTCCCAGTAT 

7001 TTTATTGCTT 
AAATAACGAA 
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ACATGCTAGT TTTACGATTA 
TGTACGATCA AAATGCTAAT 

CTTTCAGGTA ATGACCTGAT 
GAAAGTCCAT TACTGGACTA 

CCTCTCCGGC ATGAATTTAT 
GGAGAGGCCG TACTTAAATA 

GTGATTTGAC TGTCTCCGGC 
CACTAAACTG ACAGAGGCCG 

CATTACTCCG GCATTGCATT 
GTAATGAGGC CGTAACGTAA 

TCCCTGCGTT GAAATTAAGG 
AGGGACGCAA CTTTAATTCC 

ATGTTTTTGG TACAACCGAT 
TACAAAAACC ATGTTGGCTA 

AATTTTGCTA ACTCTCTGCC 
TTAAAACGAT TGAGAGACGG 



Clal 



CCGTTCATCG ATTCTCTTGT 
GGCAAGTAGC TAAGAGAACA 

AGCCTTTGTA GACCTCTCAA 
TCGGAAACAT CTGGAGAGTT 

CAGCTAGAAC GGTTGAATAT 
GTCGATCTTG CCAACTTATA 

CTTTCTCACC CGTTTGAATC 
GAAAGAGTGG G C AAACTT AG 

TAAAATATAT GAGGGTTCTA 
ATTTTATATA CTCCCAAGAT 

CTTCACCAGC AAAAGTATTA 
GAAGTGGTCG TTTTCATAAT 

TTAGCTTTAT GCTCTGAGGC 
AATCGAAATA CGAGACTCCG 

TTGCTTGTAC GATTTATTGG 
AACGAACATG CTAAATAACC 



7051 



ATGTT 
TACAA 
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cia i (6645^ Figure 4A 

Eco R I (6389) 
BamH I (6383)- 

: i ! 
i - * 

FLAG ■ \\ \ : 
cmpA \ \ \\ s : : 



RNAstart \\\ ;\ 




R4B(f1phgge)IRregcn 7 \ \ \ gflrick 

C37B3>T T>T>lle(IR2frcm R408miss'rgO ^ \ fieri 



\\ G3225>A (R4CB IR1 mut missing!) 
C/o I (3280) 

G3329>T, W >Leu (PCR mtt . ret reecfed fcr IR) 
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Figure 4B Hindi n 

1 AGCTTCGAGA AATTCACCTC GAAAGCAAGC TGATAAACCG ATACAATTAA 
TCGAAGCTCT TTAAGTGGAG CTTTCGTTCG ACTATTTGGC TATGTTAATT 

51 AGGCTCCTTT TGGAGCCTTT TTTTTTGGAG AATTAATTCA ATCATGCCAG 
TCCGAGGAAA ACCTCGGAAA AAAAAACCTC TTAATTAAGT TAGTACGGTC 

101 TTCTTTTGGG TATTCCGTTA TTATTGCGTT TCCTCGGTTT CCTTCTGGTA 
AAGAAAACCC ATAAGGCAAT AATAACGCAA AGGAGCCAAA GGAAGACCAT 

151 ACTTTGTTCG GCTATCTGCT TACTTTCCTT AAAAAGGGCT TCGGTAAGAT 
TGAAACAAGC CGATAGACGA ATGAAAGGAA TTTTTCCCGA AGCCATTCTA 

2 01 AGCTATTGCT ATTTCATTGT TTCTTGCTCT TATTATTGGG CTTAACTCAA 
TCGATAACGA TAAAGTAACA AAGAACGAGA ATAATAACCC GAATTGAGTT 

2 51 TTCTTGTGGG TTATCTCTCT GATATTAGCG CACAATTACC CTCTGATTTT 

AAGAACACCC AATAGAGAGA CTATAATCGC GTGTTAATGG GAGACTAAAA 

3 01 GTTCAGGGCG TTCAGTTAAT TCTCCCGTCT AATGCGCTTC CCTGTTTTTA 

CAAGTCCCGC AAGTCAATTA AGAGGGCAGA TTACGCGAAG G G AC AAAAAT 

3 51 TGTTATTCTC TCTGTAAAGG CTGCTATTTT CATTTTTGAC GTTAAACAAA 

ACAATAAGAG AGACATTTCC G AC GAT AAAA GTAAAAACTG CAATTTGTTT 

4 01 AAATCGTTTC TTATTTGGAT TGGGATAAAT AAATATGGCT GTTTATTTTG 

TTTAG C AAAG AATAAACCTA ACCCTATTTA TTTAT AC CG A CAAATAAAAC 

4 51 TAACTGGCAA ATTAGGCTCT GGAAAGACGC TCGTTAGCGT TGGTAAGATT 
ATTGACCGTT TAATCCGAGA CCTTTCTGCG AGCAATCGCA ACCATTCTAA 

501 CAGGATAAAA TTGTAGCTGG GTGCAAAATA GCAACTAATC TTGATTTAAG 
GTCCTATTTT AACATCGACC CACGTTTTAT CGTTGATTAG AACTAAATTC 

551 GCTTCAAAAC CTCCCGCAAG TCGGGAGGTT CGCTAAAACG CCTCGCGTTC 
CGAAGTTTTG GAGGGCGTTC AGCCCTCCAA GCGATTTTGC GGAGCGCAAG 

601 TTAGAATACC GGATAAGCCT TCTATTTCTG ATTTGCTTGC TATTGGTCGT 
AATCTTATGG CCTATTCGGA AGATAAAGAC TAAACGAACG ATAACCAGCA 

651 GGTAATGATT CCTACGACGA AAATAAAAAC GGTTTGCTTG TTCTTGATGA 
CCATTACTAA GGATGCTGCT TTTATTTTTG CCAAACGAAC AAGAACTACT 

701 ATGCGGTACT TGGTTTAATA CCCGTTCATG GAATGACAAG G AAAG AC AG C 
TACGCCATGA ACCAAATTAT GGGCAAGTAC CTTACTGTTC CTTTCTGTCG 



751 



CGATTATTGA TTGGTTTCTT CATGCTCGTA AATTGGGATG GGATATTATT 
GCTAATAACT AA C C AAAG AA GTACGAGCAT TTAACCCTAC CCTATAATAA 
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801 TTTCTTGTTC AGGATTTATC TATTGTTGAT AAACAGGCGC GTTCTGCATT 
AAAGAACAAG TCCTAAATAG ATAACAACTA TTTGTCCGCG CAAGACGTAA 

8 51 AGCTGAACAC GTTGTTTATT GTCGCCGTCT GGACAGAATT ACTTTACCCT 
TCGACTTGTG CAACAAATAA CAGCGGCAGA CCTGTCTTAA TGAAATGGGA 

901 TTGTCGGCAC TTTATATTCT CTTGTTACTG GCTCAAAAAT GCCTCTGCCT 
AACAGCCGTG AAATATAAGA GAACAATGAC CGAGTTTTTA CGGAGACGGA 

951 AAATTACATG TTGGTGTTGT TAAATATGGT GATTCTCAAT TAAGCCCTAC 
TTTAATGTAC AACCACAACA ATTTATACCA CTAAGAGTTA ATTCGGGATG 

1001 TGTTGAGCGT TGGCTTTATA CTGGTAAGAA TTTATATAAC GCATATGACA 
ACAACTCGCA ACCGAAATAT GACCATTCTT AAATATATTG CGTATACTGT 

1051 CTAAACAGGC TTTTTCCAGT AATTATGATT CAGGTGTTTA TTCATATTTA 
GATTTGTCCG AAAAAGGTCA TTAATACTAA GTCCACAAAT AAGTATAAAT 

1101 ACCCCTTATT TATCACACGG TCGGTATTTC AAACCATTAA ATTTAGGTCA 
TGGGGAATAA ATAGTGTGCC AGCCATAAAG TTTGGTAATT TAAATCCAGT 

1151 GAAGATGAAA TTAACTAAAA TATATTTGAA AAAGTTTTCT CGCGTTCTTT 
CTTCTACTTT AATTGATTTT ATATAAACTT TTTCAAAAGA GCGCAAGAAA 

12 01 GTCTTGCGAT AGGATTTGCA TCAGCATTTA CATATAGTTA TATAACCCAA 
CAGAACGCTA TCCTAAACGT AGTCGTAAAT GTATATCAAT ATATTGGGTT 

12 51 CCTAAGCCGG AGGTTAAAAA GGTAGTCTCT CAGACCTATG ATTTTGATAA 

GGATTCGGCC TCCAATTTTT CCATCAGAGA GTCTGGATAC TAAAACTATT 

13 01 ATTCACTATT GACTCTTCTC AGCGTCTTAA TCTAAGCTAT CGCTATGTTT 

TAAGTGATAA CTGAGAAGAG TCGCAGAATT AGATTCGATA GCGATACAAA 

13 51 TCAAGGATTC TAAGGGAAAA TTAATTAATA GCGACGATTT ACAGAAGCAA 

AGTTCCTAAG ATTCCCTTTT AATTAATTAT CGCTGCTAAA TGTCTTCGTT 

14 01 GGTTATTCCA TC AC AT A TAT TGATTTATGT ACTGTTTCAA TTAAAAAAGG 

CCAATAAGGT AGTGTATATA ACTAAATACA TGACAAAGTT AATTTTTTCC 

1451 TAATTCAAAT GAAATTGTTA AATGTAATTA ATTTTGTTTT CTTGATGTTT 
ATTAAGTTTA CTTTAACAAT TTACATTAAT TAAAACAAAA GAACTACAAA 

1501 GTTTCATCAT CTTCTTTTGC TCAAGTAATT GAAATGAATA ATTCGCCTCT 
CAAAGTAGTA GAAGAAAACG AGTTCATTAA CTTTACTTAT TAAGCGGAGA 

15 51 GCGCGATTTC GTGACTTGGT ATTCAAAGCA AACAGGTGAA TCTGTTATTG 

CGCGCTAAAG CACTGAACCA TAAGTTTCGT TTGTCCACTT AGACAATAAC 

1601 TCTCACCTGA TGTTAAAGGT ACAGTGACTG TATATTCCTC TGACGTTAAG 
AGAGTGGACT ACAATTTCCA TGTCACTGAC ATATAAGGAG ACTGCAATTC 
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16 51 CCTGAAAATT TACGCAATTT CTTTATCTCT GTTTTACGTG CTAATAATTT 
GGACTTTTAA ATGCGTTAAA GAAATAGAGA CAAAATGCAC GATTATTAAA 

1701 TGATATGGTT GGCTCTAATC CTTC C ATAAT TCAGAAATAT AACCCAAATA 
ACTATACCAA CCGAGATTAG GAAGGTATTA AGTCTTTATA TTGGGTTTAT 

1751 GTCAGGATTA TATTGATGAA TTGCCATCAT CTGATATTCA GGAATATGAT 
CAGTCCTAAT ATAACTACTT AACGGTAGTA GACTATAAGT CCTTATACTA 

1801 GATAATTCCG CTCCTTCTGG TGGTTTCTTT GTTCCGCAAA ATGATAATGT 
CTATTAAGGC GAGGAAGACC ACCAAAGAAA CAAGGCGTTT TACTATTACA 

18 51 TACTCAAACA TTTAAAATTA ATAACGTTCG CGCAAAGGAT TTAATAAGGG 
ATGAGTTTGT AAATTTTAAT TATTGCAAGC GCGTTTCCTA AATTATTCCC 

1901 TTGTAGAATT GTTTGTTAAA TCTAATACAT CTAAATCCTC AAATGTATTA 
AACATCTTAA CAAACAATTT AGATTATGTA GATTTAGGAG TTTACATAAT 

1951 TCTGTTGATG GTTCTAACTT ATTAGTAGTT AGCGCCCCTA AAGATATTTT 
AGACAACTAC CAAGATTGAA TAATCATCAA TCGCGGGGAT TTCTATAAAA 

20 01 AG ATAAC CTT CCGCAATTTC TTTCTACTGT TGATTTGCCA ACTGACCAGA 
TCTATTGGAA GGCGTTAAAG AAAGATGACA ACTAAACGGT TGACTGGTCT 

2 051 TATTGATTGA AGGATTAATT TTCGAGGTTC AGCAAGGTGA TGCTTTAGAT 
ATAACTAACT TCCTAATTAA AAGCTCCAAG TCGTTCCACT ACGAAATCTA 

2101 TTTTCCTTTG CTGCTGGCTC TCAGCGCGGC ACTGTTGCTG GTGGTGTTAA 
AAAAGGAAAC GACGACCGAG AGTCGCGCCG TGACAACGAC CACCACAATT 

2151 TACTGACCGT CTAACCTCTG TTTTATCTTC TGCGGGTGGT TCGTTCGGTA 
ATGACTGGCA GATTGGAGAC AAAATAGAAG ACGCCCACCA AGCAAGCCAT 

22 01 TTTTTAACGG CGATGTTTTA GGGCTATCAG TTCGCGCATT AAAGACTAAT 
AAAAATTGCC GCTACAAAAT CCCGATAGTC AAGCGCGTAA TTTCTGATTA 

22 51 AG C C ATT C AA AAATATTGTC TGTGCCTCGT ATTCTTACGC TTTCAGGTCA 

TCGGTAAGTT TTTATAACAG ACACGGAGCA TAAGAATGCG AAAGTCCAGT 

23 01 GAAGGGTTCT ATTTCTGTTG GCCAGAATGT CCCTTTTATT ACTGGTCGTG 

CTTCCCAAGA TAAAGACAAC CGGTCTTACA GGGAAAATAA TGACCAGCAC 

23 51 TAACTGGTGA ATCTGCCAAT GTAAATAATC CATTTCAGAC AATTGAGCGT 

ATTGACCACT TAGACGGTTA CATTTATTAG GTAAAGTCTG TTAACTCGCA 

24 01 CAAAATGTTG GTATTTCTAT GAGTGTTTTT CCCGTTGCAA TGGCTGGCGG 

GTTTTACAAC CATAAAGATA CTCACAAAAA GGGCAACGTT ACCGACCGCC 

24 51 TAATATTGTT TTAGATATAA CCAGTAAGGC CGATAGTTTG AGTTCTTCTA 
ATTATAACAA AATCTATATT GGTCATTCCG GCTATCAAAC TCAAGAAGAT 
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2 501 CTCAGGCAAG TGATGTTATT ACTAATCAAA GAAGTATTGC GACAACGGTT 
GAGTCCGTTC ACTACAATAA TGATTAGTTT CTTCATAACG CTGTTGCCAA 

2 551 AATTTGCGTG ATGGTCAGAC TCTTTTGCTC GGTGGCCTCA CTGATTACAA 
TTAAACGCAC TACCAGTCTG AGAAAACGAG CCACCGGAGT GACTAATGTT 

2 601 AAACACTTCT CAAGATTCTG GTGTGCCGTT CCTGTCTAAA ATCCCTTTAA 
TTTGTGAAGA GTTCTAAGAC CACACGGCAA GGACAGATTT TAGGGAAATT 

2 651 TCGGCCTCCT GTTTAGCTCC CGTTCTGATT CTAACGAGGA AAGCACGTTG 
AGCCGGAGGA CAAATCGAGG GCAAGACTAA GATTGCTCCT TTCGTGCAAC 

2 701 TACGTGCTCG TCAAAGCAAC CAT AGTACG C GCCCTGTAGC GGCGCATTAA 
ATGCACGAGC AGTTTCGTTG GTATCATGCG CGGGACATCG CCGCGTAATT 

2 751 GCGCGGCGGG TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC 
CGCGCCGCCC ACACCACCAA TGCGCGTCGC ACTGGCGATG TGAACGGTCG 

28 01 GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC CCTTCCTTTC TCGCCACGTT 
CGGGATCGCG GGCGAGGAAA GCGAAAGAAG GGAAGGAAAG AGCGGTGCAA 

BamHI 



2 851 CTCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGATCCCT TTAGGGTTCC 
GAGGCCGAAA GGGGCAGTTC GAGATTTAGC CCCCTAGGGA AATCCCAAGG 

2 901 GATTTAGTGC TTTACGGCAC CTCGACCTCC AAAAACTTGA TTTGGGTGAT 
CTAAATCACG AAATGCCGTG GAGCTGGAGG TTTTTGAACT AAACCCACTA 

2 951 GGTTCACGTA GTGGGCCATC GCCCTAATAG ACGGTTTTTC GCCCTTTGAC 

CCAAGTGCAT CACCCGGTAG CGGGATTATC TGCCAAAAAG CGGGAAACTG 

3 001 GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA ACTGGAACAA 

CAACCTCAGG TGCAAGAAAT TATCACCTGA GAACAAGGTT TGACCTTGTT 

3051 CACTCAACCC TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG 
GTGAGTTGGG ATAGAGCCAG ATAAGAAAAC TAAATATTCC CTAAAACGGC 

3101 ATTTCGGCCT ATTGGTTAAA AAATGAGCTG ATTTAACAAA AATTTAACGC 
TAAAGCCGGA TAACCAATTT TTTACTCGAC TAAATTGTTT TTAAATTGCG 

3151 GAATTTTAAC AAAATATTAA CGTTTACAAT TTAAATATTT GCTTATACAA 
CTTAAAATTG TTTTATAATT GCAAATGTTA AATTTATAAA CGAATATGTT 

32 01 TCTTCCTGTT TTTGGGGCTT TTCTGATTAT CAACCGGGGT ACATATGATT 
AGAAGGACAA AAACCCCGAA AAGACTAATA GTTGGCCCCA TGTATACTAA 

Clal 



32 51 GACATGCTAG TTTTACGATT ACCGTTCATC GATTCTCTTG TTTGCTCCAG 
CTGTACGATC AAAATGCTAA TGGCAAGTAG CTAAGAGAAC AAACGAGGTC 
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33 01 ACTCTCAGGC AATGACCTGA TAGCCTTTTT AGACCTCTCA AAAATAGCTA 
TGAGAGTCCG TTACTGGACT ATCGGAAAAA TCTGGAGAGT TTTTATCGAT 

33 51 CCCTCTCCGG CATGAATTTA TCAGCTAGAA CGGTTGAATA TCATATTGAT 

GGGAGAGGCC GTACTTAAAT AGTCGATCTT GCCAACTTAT AGTATAACTA 

34 01 GGTGATTTGA CTGTCTCCGG CCTTTCTCAC CCGTTTGAAT CTTTACCTAC 

CCACTAAACT GACAGAGGCC GGAAAGAGTG GGCAAACTTA GAAATGGATG 

34 51 ACATTACTCA GGCATTGCAT TTAAAATATA TGAGGGTTCT AAAAATTTTT 
TGTAATGAGT CCGTAACGTA AATTTTATAT ACTCCCAAGA TTTTTAAAAA 

3 501 ATCCTTGCGT TGAAATAAAG GCTTCTCCCG CAAAAGTATT ACAGGGTCAT 
TAGGAACGCA ACTTTATTTC CGAAGAGGGC GTTTTCATAA TGTCCCAGTA 

3 551 AATGTTTTTG GTACAACCGA TTTAGCTTTA TGCTCTGAGG CTTTATTGCT 
TTACAAAAAC CATGTTGGCT AAATCGAAAT ACGAGACTCC GAAATAACGA 

3 601 TAATTTTGCT AATTCTTTGC CTTGCCTGTA TGATTTATTG GATGTTAACG 
ATTAAAACGA TTAAGAAACG GAACGGACAT ACTAAATAAC CTACAATTGC 

3 651 CTACTACTAT TAGTAGAATT GATGCCACCT TTTCAGCTCG CGCCCCAAAT 
GATGATGATA ATCATCTTAA CTACGGTGGA AAAGTCGAGC GCGGGGTTTA 

3 701 GAAAATATAG CTAAACAGGT TATTGACCAT TTGCGAAATG TATCTAATGG 
CTTTTATATC GATTTGTCCA ATAACTGGTA AACGCTTTAC AT AG ATT AC C 

3 7 51 TCAAACTAAA TCTACTCGTT CGCAGAATTG GGAATCAACT GTTACATGGA 
AGTTTGATTT AGATGAGCAA GCGTCTTAAC CCTTAGTTGA CAATGTACCT 

3 801 ATGAAACTTC CAGACACCGT ACTTTAGTTG CATATTTAAA ACATGTTGAG 
TACTTTGAAG GTCTGTGGCA TGAAATCAAC GTATAAATTT TGTACAACTC 

3 851 CTACAGCACC AGATCCAGCA ATTAAGCTCT AAGCCATCCG CAAAAATGAC 
GATGTCGTGG TCTAGGTCGT TAATTCGAGA TTCGGTAGGC GTTTTTACTG 

3 901 CTCTTATCAA AAGGAGCAAT TAAAGGTACT CTCTAATCCT GACCTGTTGG 
GAGAATAGTT TTCCTCGTTA ATTTCCATGA GAGATTAGGA CTGGACAACC 

3 951 AGTTTGCTTC CGGTCTGGTT CGCTTTGAAG CTCGAATTAA AACGCGATAT 

TCAAACGAAG GCCAGACCAA GCGAAACTTC GAGCTTAATT TTGCGCTATA 

4 001 TTGAAGTCTT TCGGGCTTCC TCTTAATCTT TTTGATGCAA TCCGCTTTGC 

AACTTCAGAA AGCCCGAAGG AGAATTAGAA AAACTACGTT AGGCGAAACG 

4 051 TTCTGACTAT AATAGTCAGG GTAAAGACCT GATTTTTGAT TTATGGTCAT 
AAGACTGATA TTATCAGTCC CATTTCTGGA CTAAAAACTA AATACCAGTA 

4101 TCTCGTTTTC TGAACTGTTT AAAGCATTTG AGGGGGATTC AATGAATATT 
AGAGCAAAAG ACTTGACAAA TTTCGTAAAC TCCCCCTAAG TTACTTATAA 
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4151 TATGACGATT CCGCAGTATT GGACGCTATC CAGTCTAAAC ATTTTACTAT 
ATACTGCTAA GGCGTCATAA CCTGCGATAG GTCAGATTTG TAAAATGATA 

4 201 TACCCCCTCT GGCAAAACTT CTTTTGCAAA AGCCTCTCGC TATTTTTGTT 
ATGGGGGAGA CCGTTTTGAA GAAAACGTTT TCGGAGAGCG ATAAAAACAA 

4 251 TTTATCGTCG TCTGGTAAAC GAGGGTTATG ATAGTGTTGC TCTTACTATG 
AAATAGCAGC AGACCATTTG CTCCCAATAC TATCACAACG AGAATGATAC 

4 301 CCTCGTAATT CCTTTTGGCG TTATGTATCT GCATTAGTTG AATGTGGTAT 
GGAGCATTAA GGAAAACCGC AATACATAGA CGTAATCAAC TT AC AC CAT A 

4 351 TCCTAAATCT CAACTGATGA ATCTTTCTAC CTGTAATAAT GTTGTTCCGT 
AGGATTTAGA GTTGACTACT TAGAAAGATG GACATTATTA CAACAAGGCA 

4 4 01 TAGTTCGTTT TATTAACGTA GATTTTTCTT CCCAACGTCC TGACTGGTAT 
ATCAAGCAAA ATAATTGCAT CTAAAAAGAA GGGTTGCAGG ACTGACCATA 

4 4 51 AATGAGCCAG TTCTTAAAAT CGCATAAGGT AATTCACAAT GATTAAAGTT 
TTACTCGGTC AAGAATTTTA GCGTATTCCA TTAAGTGTTA CTAATTTCAA 

4 501 GAAATTAAAC CATCTCAAGC GCAATTCACT ACCCGTTCTG GTGTTTCTCG 
CTTTAATTTG GTAGAGTTCG CGTTAAGTGA TGGGCAAGAC CACAAAGAGC 

4 551 TCAGGGCAAG CCTTATTCAC TGAATGAGCA GCTTTGTTAC GTTGATTTGG 
AGTCCCGTTC GGAATAAGTG ACTTACTCGT CGAAACAATG CAACTAAACC 

4 601 GTAATGAATA TCCGGTGCTT GTCAAGATTA CTCTTGATGA AGGTCAGCCA 
CATTACTTAT AGGCCACGAA CAGTTCTAAT GAGAACTACT TCCAGTCGGT 

4 651 GCCTATGCGC CTGGTCTGTA CACCGTGCAT CTGTCCTCGT TCAAAGTTGG 
CGGATACGCG G AC CAGAC AT GTGGCACGTA GACAGGAGCA AGTTTCAACC 

4 701 TCAGTTCGGT TCTCTTATGA TTGACCGTCT GCGCCTCGTT CCGGCTAAGT 
AGTCAAGCCA AGAGAATACT AACTGGCAGA CGCGGAGCAA GG CCGATTC A 

4751 AACATGGAGC AGGTCGCGGA TTTCGACACA ATTTATCAGG CGATGATACA 
TTGTACCTCG TCCAGCGCCT AAAGCTGTGT TAAATAGTCC GCTACTATGT 

4 801 AATCTCCGTT GTACTTTGTT TCGCGCTTGG TATAATCGCT GGGGGTCAAA 
TT AG AGG C AA CATGAAACAA AGCGCGAACC ATATTAGCGA CCCCCAGTTT 

4851 GATGAGTGTT TTAGTGTATT CTTTCGCCTC TTTCGTTTTA GGTTGGTGCC 
CTACTCACAA AATCACATAA GAAAGCGGAG AAAGCAAAAT CCAACCACGG 

4 901 TTCGTAGTGG CATTACGTAT TTTACCCGTT TAATGGAAAC TTCCTCATGC 
AAGCATCACC GTAATGCATA AAATGGGCAA ATTACCTTTG AAGGAGTACG 

4 951 GTAAGTCTTT AGTCCTCAAA GCCTCCGTAG CCGTTGCTAC CCTCGTTCCG 
CATTCAGAAA TCAGGAGTTT CGGAGGCATC GGCAACGATG GGAGCAAGGC 
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50 01 ATGCTGTCTT TCGCTGCTGA GGGTGACGAT CCCGCAAAAG CGGCCTTTGA 
TACGACAGAA AGCGACGACT CCCACTGCTA GGGCGTTTTC GCCGGAAACT 

50 51 CTCCCTGCAA GCCTCAGCGA CCGAATATAT CGGTTATGCG TGGGCGATGG 
GAGGGACGTT CGGAGTCGCT GGCTTATATA GCCAATACGC ACCCGCTACC 

5101 TTGTTGTCAT TGTCGGCGCA ACTATCGGTA TCAAGCTGTT TAAGAAATTC 
AACAACAGTA ACAGCCGCGT TGATAGCCAT AGTTCGACAA ATTCTTTAAG 

5151 ACCTCGAAAG CAAGCTGATA AAGGAGGTTT CTCGATCGAG ACGTTGGGTG 
TGGAGCTTTC GTTCGACTAT TTCCTCCAAA GAGCTAGCTC TGCAACCCAC 

52 01 AGGTTCCAAC TTTCACCATA ATGAAATAAG ATCACTACCG GGCGTATTTT 
TCCAAGGTTG AAAGTGGTAT TACTTTATTC TAGTGATGGC CCGCATAAAA 

52 51 TTGAGTTATC GAGATTTTCA GGAGCTAAGG AAGCTAAAAT GGAGAAAAAA 

AACTCAATAG CTCTAAAAGT CCTCGATTCC TTCGATTTTA CCTCTTTTTT 

53 01 ATCACTGGAT ATACCACCGT TGATATATCC CAATGGCATC GTAAAGAACA 

TAGTGACCTA TATGGTGG C A ACTATATAGG GTTACCGTAG CATTTCTTGT 

53 51 TTTTGAGGCA TTTCAGTCAG TTGCTCAATG TACCTATAAC CAGACCGTTC 

AAAACTCCGT AAAGTCAGTC AACGAGTTAC ATGGATATTG GTCTGGCAAG 

54 01 AGCTGGATAT TACGGCCTTT TTAAAGACCG TAAAGAAAAA TAAGCACAAG 

TCGACCTATA ATGCCGGAAA AATTTCTGGC ATTTCTTTTT ATTCGTGTTC 

54 51 TTTTATCCGG CCTTTATTCA CATTCTTGCC CGCCTGATGA ATGCTCATCC 
AAAATAGGCC GGAAATAAGT GTAAGAACGG GCGGACTACT TACGAGTAGG 

5501 GGAGTTCCGT ATGGCAATGA AAGACGGTGA GCTGGTGATA TGGGATAGTG 
CCTCAAGGCA TACCGTTACT TTCTGCCACT CGACCACTAT ACCCTATCAC 

5551 TTCACCCTTG TTACACCGTT TTCCATGAGC AAACTGAAAC GTTTTCATCG 
AAGTGGGAAC AATGTGGCAA AAGGTACTCG TTTGACTTTG CAAAAGTAGC 

5601 CTCTGGAGTG AATACCACGA CGATTTCCGG CAGTTTCTAC ACATATATTC 
GAGACCTCAC TTATGGTGCT GCTAAAGGCC GTCAAAGATG TGTATATAAG 

5651 GCAAGATGTG GCGTGTTACG GTGAAAACCT GGCCTATTTC CCTAAAGGGT 
CGTTCTACAC CGCACAATGC CACTTTTGGA CCGGATAAAG GGATTTCCCA 

5701 TTATTGAGAA TATGTTTTTC GTCTCAGCCA ATCCCTGGGT GAGTTTCACC 
AATAACTCTT ATACAAAAAG CAGAGTCGGT TAGGGACCCA CTCAAAGTGG 

57 51 AGTTTTGATT TAAA CG TAG C CAATATGGAC AACTTCTTCG CCCCCGTTTT 
TCAAAACTAA ATTTGCATCG GTTATACCTG TTGAAGAAGC GGGGGCAAAA 

5801 CACTATGGGC AAATATTATA CGCAAGGCGA CAAGGTGCTG ATGCCGCTGG 
GTGATACCCG TTTATAATAT GCGTTCCGCT GTTCCACGAC TACGGCGACC 
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5 851 CGATTCAGGT TCATCATGCC GTTTGTGATG GCTTCCATGT CGGCAGAATG 
GCTAAGTCCA AGTAGTACGG CAAACACTAC CGAAGGTACA GCCGTCTTAC 

5 901 CTTAATGAAT TACAACAGTA CTGCGATGAG TGGCAGGGCG GGGCGTAATT 
GAATTACTTA ATGTTGTCAT GACGCTACTC ACCGTCCCGC CCCGCATTAA 

5 951 TTTTTAAGGC AGTTATTGGT GCCCTTAAAC GCCTGGTGCT AGCCTGAGGC 

AAAAATTCCG TCAATAACCA CGGGAATTTG CGGACCACGA TCGGACTCCG 

6 001 CAGTTTGCTC AGGCTCTCCC CGTGGAGGTA ATAATTGCTC GACCGATAAA 

GTCAAACGAG TCCGAGAGGG GCACCTCCAT TATTAACGAG CTGGCTATTT 

6 051 AGCGGCTTCC TGACAGGAGG CCGTTTTGTT TTGCAGCCCA CCTCAACGCA 
TCGCCGAAGG ACTGTCCTCC GGCAAAACAA AACGTCGGGT GGAGTTGCGT 

6101 ATTAATGTGA GTTAGCTCAC TCATTAGGCA CCCCAGGCTT TACACTTTAT 
TAATTACACT CAATCGAGTG AGTAATCCGT GGGGTCCGAA ATGTGAAATA 

6151 GCTTCCGGCT CGTATGTTGT GTGGAATTGT GAGCGGATAA CAATTTCACA 
CGAAGGCCGA GCATACAACA CACCTTAACA CTCGCCTATT GTTAAAGTGT 

62 01 CAGGAAACAG CTATGACCAT GATTACGAAT TTCTAGATAA CGAGGGCAAA 
GTCCTTTGTC GATACTGGTA CTAATGCTTA AAGATCTATT GCTCCCGTTT 

62 51 AAATGAAAAA GACAGCTATC GCGATTGCAG TGGCACTGGC TGGTTTCGCT 

TTTACTTTTT CTGTCGATAG CGCTAACGTC ACCGTGACCG ACCAAAGCGA 

63 01 ACCGTAGCGC AGGCCGACTA CAAAGATGTC GACTGTATTG TTTATCATGC 

TGGCATCGCG TCCGGCTGAT GTTTCTACAG CTGACATAAC AAATAGTACG 

BamHI EcoRI 



63 51 TCATTATCTT GTTG CTAAGT GTGGTGGTGG AGGATCCGAA TTCAATGCTG 

AGTAATAGAA CAACGATTCA CACCACCACC TCCTAGGCTT AAGTTACGAC 

64 01 GCGGCGGCTC TGGTGGTGGT TCTGGTGGCG GCTCTGAGGG TGGTGGCTCT 

CGCCGCCGAG ACCACCACCA AGACCACCGC CGAGACTCCC ACCACCGAGA 

64 51 GAGGGTGGCG GTTCTGAGGG TGGCGGCTCT GAGGGAGGCG GTTCCGGTGG 

CTCCCACCGC CAAGACTCCC ACCGCCGAGA CTCCCTCCGC CAAGGCCACC 

65 01 TGGCTCTGGT TCCGGTGATT TTGATTATGA AAAGATGGCA AACGCTAATA 

ACCGAGACCA AGGCCACTAA AACTAATACT TTTCTACCGT TTGCGATTAT 

65 51 AGGGGGCTAT GACCGAAAAT GCCGATGAAA ACGCGCTACA GTCTGACGCT 

TCCCCCGATA CTGGCTTTTA CGGCTACTTT TGCGCGATGT CAGACTGCGA 
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66 01 AAAGGCAAAC TTGATTCTGT CGCTACTGAT TACGGTGCTG CTATCGATGG 

TTTCCGTTTG AACTAAGACA GCGATGACTA ATGCCACGAC GATAGCTACC 

6651 TTTCATTGGT GACGTTTCCG GCCTTGCTAA TGGTAATGGT GCTACTGGTG 
AAAGTAACCA CTGCAAAGGC CGGAACGATT ACCATTACCA CGATGACCAC 

67 01 ATTTTGCTGG CTCTAATTCC CAAATGGCTC AAGTCGGTGA CGGTGATAAT 

TAAAACGACC GAGATTAAGG GTTTACCGAG TTCAGCCACT GCCACTATTA 

67 51 TCACCTTTAA TGAATAATTT CCGTCAATAT TTACCTTCCC TCCCTCAATC 

AGTGGAAATT ACTTATTAAA GGCAGTTATA AATGGAAGGG AGGGAGTTAG 

68 01 GGTTGAATGT CGCCCTTTTG TCTTTGGCGC TGGTAAACCA TATGAATTTT 

CCAACTTACA GCGGGAAAAC AG AAACCG CG ACCATTTGGT ATACTTAAAA 

6851 CTATTGATTG TGACAAAATA AACTTATTCC GTGGTGTCTT TGCGTTTCTT 
GATAACTAAC ACTGTTTTAT TTGAATAAGG CACCACAGAA ACGCAAAGAA 

6901 TTATATGTTG CCACCTTTAT GTATGTATTT TCTACGTTTG CTAACATACT 
AATATACAAC GGTGGAAATA CATACATAAA AGATGCAAAC GATTGTATGA 

Hindlll 



6951 



GCGTAATAAG GAGTCTTGAT A 
CGCATTATTC CTCAGAACTA T 
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